Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 1083397 |
| Missing cells | 11154479 |
| Missing cells (%) | 24.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.0 GiB |
| Average record size in memory | 1.9 KiB |
Variable types
| Categorical | 22 |
|---|---|
| Numeric | 17 |
| Boolean | 3 |
restaurant_link has a high cardinality: 1083397 distinct values | High cardinality |
restaurant_name has a high cardinality: 840914 distinct values | High cardinality |
original_location has a high cardinality: 65997 distinct values | High cardinality |
region has a high cardinality: 250 distinct values | High cardinality |
province has a high cardinality: 1333 distinct values | High cardinality |
city has a high cardinality: 43495 distinct values | High cardinality |
address has a high cardinality: 1034685 distinct values | High cardinality |
awards has a high cardinality: 917 distinct values | High cardinality |
popularity_detailed has a high cardinality: 981409 distinct values | High cardinality |
popularity_generic has a high cardinality: 981940 distinct values | High cardinality |
top_tags has a high cardinality: 39962 distinct values | High cardinality |
price_range has a high cardinality: 7298 distinct values | High cardinality |
meals has a high cardinality: 745 distinct values | High cardinality |
cuisines has a high cardinality: 97741 distinct values | High cardinality |
special_diets has a high cardinality: 68 distinct values | High cardinality |
features has a high cardinality: 56453 distinct values | High cardinality |
original_open_hours has a high cardinality: 237890 distinct values | High cardinality |
keywords has a high cardinality: 99001 distinct values | High cardinality |
open_days_per_week is highly correlated with open_hours_per_week | High correlation |
open_hours_per_week is highly correlated with open_days_per_week | High correlation |
avg_rating is highly correlated with food and 3 other fields | High correlation |
total_reviews_count is highly correlated with reviews_count_in_default_language and 3 other fields | High correlation |
reviews_count_in_default_language is highly correlated with total_reviews_count and 5 other fields | High correlation |
excellent is highly correlated with total_reviews_count and 5 other fields | High correlation |
very_good is highly correlated with total_reviews_count and 5 other fields | High correlation |
average is highly correlated with total_reviews_count and 5 other fields | High correlation |
poor is highly correlated with reviews_count_in_default_language and 4 other fields | High correlation |
terrible is highly correlated with reviews_count_in_default_language and 4 other fields | High correlation |
food is highly correlated with avg_rating and 3 other fields | High correlation |
service is highly correlated with avg_rating and 3 other fields | High correlation |
value is highly correlated with avg_rating and 3 other fields | High correlation |
atmosphere is highly correlated with avg_rating and 3 other fields | High correlation |
open_days_per_week is highly correlated with open_hours_per_week | High correlation |
open_hours_per_week is highly correlated with open_days_per_week | High correlation |
avg_rating is highly correlated with food and 3 other fields | High correlation |
total_reviews_count is highly correlated with reviews_count_in_default_language and 4 other fields | High correlation |
reviews_count_in_default_language is highly correlated with total_reviews_count and 5 other fields | High correlation |
excellent is highly correlated with total_reviews_count and 4 other fields | High correlation |
very_good is highly correlated with total_reviews_count and 5 other fields | High correlation |
average is highly correlated with total_reviews_count and 5 other fields | High correlation |
poor is highly correlated with total_reviews_count and 5 other fields | High correlation |
terrible is highly correlated with reviews_count_in_default_language and 3 other fields | High correlation |
food is highly correlated with avg_rating and 3 other fields | High correlation |
service is highly correlated with avg_rating and 3 other fields | High correlation |
value is highly correlated with avg_rating and 3 other fields | High correlation |
atmosphere is highly correlated with avg_rating and 3 other fields | High correlation |
avg_rating is highly correlated with food and 3 other fields | High correlation |
total_reviews_count is highly correlated with reviews_count_in_default_language | High correlation |
reviews_count_in_default_language is highly correlated with total_reviews_count and 5 other fields | High correlation |
excellent is highly correlated with reviews_count_in_default_language and 1 other fields | High correlation |
very_good is highly correlated with reviews_count_in_default_language and 4 other fields | High correlation |
average is highly correlated with reviews_count_in_default_language and 3 other fields | High correlation |
poor is highly correlated with reviews_count_in_default_language and 3 other fields | High correlation |
terrible is highly correlated with reviews_count_in_default_language and 3 other fields | High correlation |
food is highly correlated with avg_rating and 3 other fields | High correlation |
service is highly correlated with avg_rating and 3 other fields | High correlation |
value is highly correlated with avg_rating and 2 other fields | High correlation |
atmosphere is highly correlated with avg_rating and 2 other fields | High correlation |
gluten_free is highly correlated with vegan_options and 1 other fields | High correlation |
vegan_options is highly correlated with gluten_free and 2 other fields | High correlation |
vegetarian_friendly is highly correlated with vegan_options and 1 other fields | High correlation |
special_diets is highly correlated with gluten_free and 2 other fields | High correlation |
country is highly correlated with latitude and 1 other fields | High correlation |
latitude is highly correlated with country and 1 other fields | High correlation |
longitude is highly correlated with country and 1 other fields | High correlation |
claimed is highly correlated with vegetarian_friendly | High correlation |
special_diets is highly correlated with vegetarian_friendly and 2 other fields | High correlation |
vegetarian_friendly is highly correlated with claimed and 4 other fields | High correlation |
vegan_options is highly correlated with special_diets and 2 other fields | High correlation |
gluten_free is highly correlated with special_diets and 2 other fields | High correlation |
open_days_per_week is highly correlated with open_hours_per_week and 1 other fields | High correlation |
open_hours_per_week is highly correlated with open_days_per_week and 1 other fields | High correlation |
working_shifts_per_week is highly correlated with open_days_per_week and 1 other fields | High correlation |
avg_rating is highly correlated with food and 3 other fields | High correlation |
total_reviews_count is highly correlated with reviews_count_in_default_language and 3 other fields | High correlation |
default_language is highly correlated with vegetarian_friendly | High correlation |
reviews_count_in_default_language is highly correlated with total_reviews_count and 5 other fields | High correlation |
excellent is highly correlated with total_reviews_count and 5 other fields | High correlation |
very_good is highly correlated with total_reviews_count and 5 other fields | High correlation |
average is highly correlated with total_reviews_count and 5 other fields | High correlation |
poor is highly correlated with reviews_count_in_default_language and 4 other fields | High correlation |
terrible is highly correlated with reviews_count_in_default_language and 4 other fields | High correlation |
food is highly correlated with avg_rating and 3 other fields | High correlation |
service is highly correlated with avg_rating and 3 other fields | High correlation |
value is highly correlated with avg_rating and 3 other fields | High correlation |
atmosphere is highly correlated with avg_rating and 3 other fields | High correlation |
region has 50323 (4.6%) missing values | Missing |
province has 340632 (31.4%) missing values | Missing |
city has 400685 (37.0%) missing values | Missing |
latitude has 15790 (1.5%) missing values | Missing |
longitude has 15790 (1.5%) missing values | Missing |
awards has 820264 (75.7%) missing values | Missing |
popularity_detailed has 94988 (8.8%) missing values | Missing |
popularity_generic has 97792 (9.0%) missing values | Missing |
top_tags has 110634 (10.2%) missing values | Missing |
price_level has 277205 (25.6%) missing values | Missing |
price_range has 779070 (71.9%) missing values | Missing |
meals has 448050 (41.4%) missing values | Missing |
cuisines has 169103 (15.6%) missing values | Missing |
special_diets has 743141 (68.6%) missing values | Missing |
features has 765990 (70.7%) missing values | Missing |
original_open_hours has 489565 (45.2%) missing values | Missing |
open_days_per_week has 489565 (45.2%) missing values | Missing |
open_hours_per_week has 489565 (45.2%) missing values | Missing |
working_shifts_per_week has 489565 (45.2%) missing values | Missing |
avg_rating has 96636 (8.9%) missing values | Missing |
total_reviews_count has 52235 (4.8%) missing values | Missing |
default_language has 95193 (8.8%) missing values | Missing |
reviews_count_in_default_language has 95193 (8.8%) missing values | Missing |
excellent has 95193 (8.8%) missing values | Missing |
very_good has 95193 (8.8%) missing values | Missing |
average has 95193 (8.8%) missing values | Missing |
poor has 95193 (8.8%) missing values | Missing |
terrible has 95193 (8.8%) missing values | Missing |
food has 484072 (44.7%) missing values | Missing |
service has 479110 (44.2%) missing values | Missing |
value has 480705 (44.4%) missing values | Missing |
atmosphere has 821612 (75.8%) missing values | Missing |
keywords has 984199 (90.8%) missing values | Missing |
total_reviews_count is highly skewed (γ1 = 25.28240244) | Skewed |
average is highly skewed (γ1 = 21.42675175) | Skewed |
restaurant_link is uniformly distributed | Uniform |
address is uniformly distributed | Uniform |
popularity_detailed is uniformly distributed | Uniform |
popularity_generic is uniformly distributed | Uniform |
keywords is uniformly distributed | Uniform |
restaurant_link has unique values | Unique |
total_reviews_count has 44149 (4.1%) zeros | Zeros |
excellent has 146592 (13.5%) zeros | Zeros |
very_good has 278879 (25.7%) zeros | Zeros |
average has 493840 (45.6%) zeros | Zeros |
poor has 614652 (56.7%) zeros | Zeros |
terrible has 573943 (53.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-12-08 05:53:21.849890 |
|---|---|
| Analysis finished | 2021-12-08 06:05:07.011385 |
| Duration | 11 minutes and 45.16 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 1083397 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 76.2 MiB |
| g4173422-d19235050 | 1 |
|---|---|
| g7377416-d2002991 | 1 |
| g1073586-d14939977 | 1 |
| g187791-d15780017 | 1 |
| g187803-d12133958 | 1 |
| Other values (1083392) |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 16.73728282 |
| Min length | 15 |
Characters and Unicode
| Total characters | 18133122 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1083397 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | g10001637-d10002227 |
|---|---|
| 2nd row | g10001637-d14975787 |
| 3rd row | g10002858-d4586832 |
| 4th row | g10002986-d3510044 |
| 5th row | g10022428-d9767191 |
Common Values
| Value | Count | Frequency (%) |
| g4173422-d19235050 | 1 | < 0.1% |
| g7377416-d2002991 | 1 | < 0.1% |
| g1073586-d14939977 | 1 | < 0.1% |
| g187791-d15780017 | 1 | < 0.1% |
| g187803-d12133958 | 1 | < 0.1% |
| g187458-d21360473 | 1 | < 0.1% |
| g1547037-d4919342 | 1 | < 0.1% |
| g504128-d10146600 | 1 | < 0.1% |
| g3320396-d3627425 | 1 | < 0.1% |
| g804273-d19082912 | 1 | < 0.1% |
| Other values (1083387) | 1083387 |
Length
| Value | Count | Frequency (%) |
| g4173422-d19235050 | 1 | < 0.1% |
| g187391-d3241304 | 1 | < 0.1% |
| g316018-d1978381 | 1 | < 0.1% |
| g8429732-d8342611 | 1 | < 0.1% |
| g503723-d20941365 | 1 | < 0.1% |
| g274914-d2289055 | 1 | < 0.1% |
| g186338-d15704010 | 1 | < 0.1% |
| g503808-d21068175 | 1 | < 0.1% |
| g2429250-d11920017 | 1 | < 0.1% |
| g9595282-d12446977 | 1 | < 0.1% |
| Other values (1083387) | 1083387 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2367706 | |
| 8 | 1676353 | |
| 7 | 1560090 | |
| 2 | 1446741 | |
| 4 | 1346138 | |
| 6 | 1335721 | |
| 3 | 1318883 | |
| 5 | 1281134 | 7.1% |
| 0 | 1276116 | 7.0% |
| 9 | 1274049 | 7.0% |
| Other values (3) | 3250191 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14882931 | |
| Lowercase Letter | 2166794 | 11.9% |
| Dash Punctuation | 1083397 | 6.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2367706 | |
| 8 | 1676353 | |
| 7 | 1560090 | |
| 2 | 1446741 | |
| 4 | 1346138 | |
| 6 | 1335721 | |
| 3 | 1318883 | |
| 5 | 1281134 | |
| 0 | 1276116 | |
| 9 | 1274049 |
Lowercase Letter
| Value | Count | Frequency (%) |
| g | 1083397 | |
| d | 1083397 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1083397 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15966328 | |
| Latin | 2166794 | 11.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2367706 | |
| 8 | 1676353 | |
| 7 | 1560090 | |
| 2 | 1446741 | |
| 4 | 1346138 | |
| 6 | 1335721 | |
| 3 | 1318883 | |
| 5 | 1281134 | |
| 0 | 1276116 | |
| 9 | 1274049 |
Latin
| Value | Count | Frequency (%) |
| g | 1083397 | |
| d | 1083397 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18133122 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2367706 | |
| 8 | 1676353 | |
| 7 | 1560090 | |
| 2 | 1446741 | |
| 4 | 1346138 | |
| 6 | 1335721 | |
| 3 | 1318883 | |
| 5 | 1281134 | 7.1% |
| 0 | 1276116 | 7.0% |
| 9 | 1274049 | 7.0% |
| Other values (3) | 3250191 |
| Distinct | 840914 |
|---|---|
| Distinct (%) | 77.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 77.3 MiB |
| Subway | 4881 |
|---|---|
| McDonald's | 4458 |
| Burger King | 2480 |
| Domino's Pizza | 2163 |
| KFC | 1501 |
| Other values (840909) |
Length
| Max length | 105 |
|---|---|
| Median length | 15 |
| Mean length | 16.36735933 |
| Min length | 1 |
Characters and Unicode
| Total characters | 17732348 |
|---|---|
| Distinct characters | 525 |
| Distinct categories | 24 ? |
| Distinct scripts | 10 ? |
| Distinct blocks | 20 ? |
Unique
| Unique | 768249 ? |
|---|---|
| Unique (%) | 70.9% |
Sample
| 1st row | Le 147 |
|---|---|
| 2nd row | Le Saint Jouvent |
| 3rd row | Au Bout du Pont |
| 4th row | Le Relais de Naiade |
| 5th row | Relais Du MontSeigne |
Common Values
| Value | Count | Frequency (%) |
| Subway | 4881 | 0.5% |
| McDonald's | 4458 | 0.4% |
| Burger King | 2480 | 0.2% |
| Domino's Pizza | 2163 | 0.2% |
| KFC | 1501 | 0.1% |
| Costa Coffee | 1367 | 0.1% |
| Starbucks | 1119 | 0.1% |
| Pizza Hut | 992 | 0.1% |
| Wild Bean Cafe | 920 | 0.1% |
| BP | 629 | 0.1% |
| Other values (840904) | 1062887 |
Length
| Value | Count | Frequency (%) |
| la | 80549 | 2.9% |
| restaurant | 68041 | 2.4% |
| 63134 | 2.2% | |
| bar | 59221 | 2.1% |
| the | 46634 | 1.7% |
| cafe | 45343 | 1.6% |
| le | 38849 | 1.4% |
| pizzeria | 35085 | 1.2% |
| de | 33968 | 1.2% |
| ristorante | 29268 | 1.0% |
| Other values (342372) | 2322299 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1949331 | 11.0% |
| 1741579 | 9.8% | |
| e | 1614156 | 9.1% |
| r | 1122456 | 6.3% |
| i | 1111677 | 6.3% |
| o | 971826 | 5.5% |
| t | 900478 | 5.1% |
| n | 874322 | 4.9% |
| s | 784395 | 4.4% |
| l | 641495 | 3.6% |
| Other values (515) | 6020633 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13003878 | |
| Uppercase Letter | 2723843 | 15.4% |
| Space Separator | 1741693 | 9.8% |
| Other Punctuation | 156647 | 0.9% |
| Decimal Number | 57223 | 0.3% |
| Dash Punctuation | 39849 | 0.2% |
| Final Punctuation | 3552 | < 0.1% |
| Open Punctuation | 1418 | < 0.1% |
| Close Punctuation | 1382 | < 0.1% |
| Modifier Symbol | 850 | < 0.1% |
| Other values (14) | 2013 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1949331 | |
| e | 1614156 | |
| r | 1122456 | |
| i | 1111677 | |
| o | 971826 | 7.5% |
| t | 900478 | 6.9% |
| n | 874322 | 6.7% |
| s | 784395 | 6.0% |
| l | 641495 | 4.9% |
| u | 508203 | 3.9% |
| Other values (166) | 2525539 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 282760 | 10.4% |
| B | 238570 | 8.8% |
| P | 226554 | 8.3% |
| R | 218269 | 8.0% |
| L | 215980 | 7.9% |
| S | 179794 | 6.6% |
| T | 172625 | 6.3% |
| A | 146714 | 5.4% |
| M | 145033 | 5.3% |
| D | 115710 | 4.2% |
| Other values (126) | 781834 |
Other Letter
| Value | Count | Frequency (%) |
| º | 35 | 18.4% |
| ª | 14 | 7.4% |
| ا | 7 | 3.7% |
| ل | 5 | 2.6% |
| 酒 | 5 | 2.6% |
| ن | 4 | 2.1% |
| 中 | 3 | 1.6% |
| 楼 | 3 | 1.6% |
| ه | 3 | 1.6% |
| ب | 3 | 1.6% |
| Other values (89) | 108 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 85471 | |
| & | 44219 | |
| . | 14769 | 9.4% |
| , | 4632 | 3.0% |
| " | 3626 | 2.3% |
| ! | 1282 | 0.8% |
| / | 1126 | 0.7% |
| @ | 467 | 0.3% |
| : | 287 | 0.2% |
| # | 260 | 0.2% |
| Other values (12) | 508 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 11168 | |
| 2 | 9127 | |
| 0 | 7520 | |
| 3 | 5656 | |
| 4 | 4370 | 7.6% |
| 9 | 4361 | 7.6% |
| 5 | 4122 | 7.2% |
| 8 | 3853 | 6.7% |
| 7 | 3524 | 6.2% |
| 6 | 3522 | 6.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 141 | |
| ̈ | 133 | |
| ̀ | 41 | 12.2% |
| ̌ | 7 | 2.1% |
| ̃ | 6 | 1.8% |
| ̂ | 4 | 1.2% |
| ِ | 1 | 0.3% |
| ् | 1 | 0.3% |
| ̊ | 1 | 0.3% |
| ̧ | 1 | 0.3% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 105 | |
| ® | 41 | 26.3% |
| № | 2 | 1.3% |
| © | 2 | 1.3% |
| ¦ | 1 | 0.6% |
| 🌏 | 1 | 0.6% |
| 🌍 | 1 | 0.6% |
| 🌎 | 1 | 0.6% |
| 🍅 | 1 | 0.6% |
| ♥ | 1 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 582 | |
| | | 156 | 20.4% |
| ~ | 14 | 1.8% |
| = | 6 | 0.8% |
| ∙ | 3 | 0.4% |
| > | 3 | 0.4% |
| ∞ | 1 | 0.1% |
| ⎜ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1741579 | ||
| 110 | < 0.1% | |
| 1 | < 0.1% | |
| 1 | < 0.1% | |
| 1 | < 0.1% | |
| 1 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 537 | |
| ` | 301 | |
| ¨ | 7 | 0.8% |
| ˝ | 2 | 0.2% |
| ^ | 2 | 0.2% |
| ΄ | 1 | 0.1% |
Format
| Value | Count | Frequency (%) |
| | 22 | |
| | 18 | |
| | 7 | 13.2% |
| | 3 | 5.7% |
| | 2 | 3.8% |
| | 1 | 1.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1357 | |
| „ | 32 | 2.3% |
| [ | 27 | 1.9% |
| { | 1 | 0.1% |
| ( | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 39692 | |
| – | 153 | 0.4% |
| — | 3 | < 0.1% |
| ‐ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1354 | |
| ] | 26 | 1.9% |
| } | 1 | 0.1% |
| ) | 1 | 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ˋ | 2 | |
| ᵒ | 1 | |
| ᶜ | 1 | |
| ʾ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3469 | |
| ” | 69 | 1.9% |
| » | 14 | 0.4% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 207 | |
| “ | 96 | |
| « | 14 | 4.4% |
Other Number
| Value | Count | Frequency (%) |
| ² | 20 | |
| ¾ | 2 | 8.7% |
| ³ | 1 | 4.3% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 5 | |
| $ | 4 | |
| £ | 2 | 18.2% |
Control
| Value | Count | Frequency (%) |
| | 1 | |
| 1 | ||
| | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 149 |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 2 |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 1 |
Line Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15724854 | |
| Common | 2004095 | 11.3% |
| Greek | 2570 | < 0.1% |
| Cyrillic | 350 | < 0.1% |
| Inherited | 335 | < 0.1% |
| Han | 83 | < 0.1% |
| Arabic | 42 | < 0.1% |
| Hebrew | 7 | < 0.1% |
| Devanagari | 6 | < 0.1% |
| Katakana | 6 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1949331 | 12.4% |
| e | 1614156 | 10.3% |
| r | 1122456 | 7.1% |
| i | 1111677 | 7.1% |
| o | 971826 | 6.2% |
| t | 900478 | 5.7% |
| n | 874322 | 5.6% |
| s | 784395 | 5.0% |
| l | 641495 | 4.1% |
| u | 508203 | 3.2% |
| Other values (199) | 5246515 |
Common
| Value | Count | Frequency (%) |
| 1741579 | ||
| ' | 85471 | 4.3% |
| & | 44219 | 2.2% |
| - | 39692 | 2.0% |
| . | 14769 | 0.7% |
| 1 | 11168 | 0.6% |
| 2 | 9127 | 0.5% |
| 0 | 7520 | 0.4% |
| 3 | 5656 | 0.3% |
| , | 4632 | 0.2% |
| Other values (89) | 40262 | 2.0% |
Han
| Value | Count | Frequency (%) |
| 酒 | 5 | 6.0% |
| 中 | 3 | 3.6% |
| 楼 | 3 | 3.6% |
| 香 | 3 | 3.6% |
| 菜 | 2 | 2.4% |
| 川 | 2 | 2.4% |
| 意 | 2 | 2.4% |
| 港 | 2 | 2.4% |
| 记 | 2 | 2.4% |
| 园 | 2 | 2.4% |
| Other values (53) | 57 |
Greek
| Value | Count | Frequency (%) |
| α | 295 | 11.5% |
| ο | 232 | 9.0% |
| ι | 148 | 5.8% |
| ν | 131 | 5.1% |
| ρ | 128 | 5.0% |
| τ | 113 | 4.4% |
| ε | 98 | 3.8% |
| Τ | 93 | 3.6% |
| κ | 86 | 3.3% |
| λ | 84 | 3.3% |
| Other values (50) | 1162 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 57 | |
| н | 29 | 8.3% |
| т | 28 | 8.0% |
| р | 26 | 7.4% |
| е | 24 | 6.9% |
| и | 21 | 6.0% |
| о | 19 | 5.4% |
| с | 15 | 4.3% |
| к | 12 | 3.4% |
| л | 10 | 2.9% |
| Other values (39) | 109 |
Arabic
| Value | Count | Frequency (%) |
| ا | 7 | |
| ل | 5 | |
| ن | 4 | |
| ه | 3 | 7.1% |
| ب | 3 | 7.1% |
| ي | 3 | 7.1% |
| م | 2 | 4.8% |
| د | 2 | 4.8% |
| ر | 2 | 4.8% |
| ک | 2 | 4.8% |
| Other values (9) | 9 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 141 | |
| ̈ | 133 | |
| ̀ | 41 | 12.2% |
| ̌ | 7 | 2.1% |
| ̃ | 6 | 1.8% |
| ̂ | 4 | 1.2% |
| ِ | 1 | 0.3% |
| ̊ | 1 | 0.3% |
| ̧ | 1 | 0.3% |
Hebrew
| Value | Count | Frequency (%) |
| ק | 2 | |
| ם | 1 | |
| ר | 1 | |
| ט | 1 | |
| ש | 1 | |
| כ | 1 |
Katakana
| Value | Count | Frequency (%) |
| ヤ | 1 | |
| イ | 1 | |
| ダ | 1 | |
| ド | 1 | |
| ン | 1 | |
| モ | 1 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 2 | |
| य | 1 | |
| त | 1 | |
| ् | 1 | |
| र | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17659195 | |
| None | 68133 | 0.4% |
| Punctuation | 4150 | < 0.1% |
| Cyrillic | 350 | < 0.1% |
| Diacriticals | 334 | < 0.1% |
| CJK | 83 | < 0.1% |
| Arabic | 43 | < 0.1% |
| Latin Ext Additional | 20 | < 0.1% |
| Hebrew | 7 | < 0.1% |
| Devanagari | 6 | < 0.1% |
| Other values (10) | 27 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1949331 | 11.0% |
| 1741579 | 9.9% | |
| e | 1614156 | 9.1% |
| r | 1122456 | 6.4% |
| i | 1111677 | 6.3% |
| o | 971826 | 5.5% |
| t | 900478 | 5.1% |
| n | 874322 | 5.0% |
| s | 784395 | 4.4% |
| l | 641495 | 3.6% |
| Other values (85) | 5947480 |
None
| Value | Count | Frequency (%) |
| é | 17796 | |
| í | 5674 | 8.3% |
| è | 5598 | 8.2% |
| ó | 3521 | 5.2% |
| á | 3486 | 5.1% |
| ü | 3171 | 4.7% |
| ä | 3157 | 4.6% |
| ö | 2963 | 4.3% |
| à | 1926 | 2.8% |
| ñ | 1416 | 2.1% |
| Other values (222) | 19425 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3469 | |
| ‘ | 207 | 5.0% |
| – | 153 | 3.7% |
| “ | 96 | 2.3% |
| • | 71 | 1.7% |
| ” | 69 | 1.7% |
| „ | 32 | 0.8% |
| | 22 | 0.5% |
| | 18 | 0.4% |
| — | 3 | 0.1% |
| Other values (8) | 10 | 0.2% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 141 | |
| ̈ | 133 | |
| ̀ | 41 | 12.3% |
| ̌ | 7 | 2.1% |
| ̃ | 6 | 1.8% |
| ̂ | 4 | 1.2% |
| ̊ | 1 | 0.3% |
| ̧ | 1 | 0.3% |
Cyrillic
| Value | Count | Frequency (%) |
| а | 57 | |
| н | 29 | 8.3% |
| т | 28 | 8.0% |
| р | 26 | 7.4% |
| е | 24 | 6.9% |
| и | 21 | 6.0% |
| о | 19 | 5.4% |
| с | 15 | 4.3% |
| к | 12 | 3.4% |
| л | 10 | 2.9% |
| Other values (39) | 109 |
Arabic
| Value | Count | Frequency (%) |
| ا | 7 | |
| ل | 5 | |
| ن | 4 | 9.3% |
| ه | 3 | 7.0% |
| ب | 3 | 7.0% |
| ي | 3 | 7.0% |
| م | 2 | 4.7% |
| د | 2 | 4.7% |
| ر | 2 | 4.7% |
| ک | 2 | 4.7% |
| Other values (10) | 10 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ệ | 5 | |
| ở | 5 | |
| ộ | 2 | 10.0% |
| Ế | 1 | 5.0% |
| ế | 1 | 5.0% |
| ḗ | 1 | 5.0% |
| Ứ | 1 | 5.0% |
| ạ | 1 | 5.0% |
| ḯ | 1 | 5.0% |
| ṓ | 1 | 5.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 5 |
CJK
| Value | Count | Frequency (%) |
| 酒 | 5 | 6.0% |
| 中 | 3 | 3.6% |
| 楼 | 3 | 3.6% |
| 香 | 3 | 3.6% |
| 菜 | 2 | 2.4% |
| 川 | 2 | 2.4% |
| 意 | 2 | 2.4% |
| 港 | 2 | 2.4% |
| 记 | 2 | 2.4% |
| 园 | 2 | 2.4% |
| Other values (53) | 57 |
Math Operators
| Value | Count | Frequency (%) |
| ∙ | 3 | |
| ∞ | 1 | 25.0% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 2 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˝ | 2 | |
| ˋ | 2 | |
| ʾ | 1 |
Hebrew
| Value | Count | Frequency (%) |
| ק | 2 | |
| ם | 1 | |
| ר | 1 | |
| ט | 1 | |
| ש | 1 | |
| כ | 1 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 2 | |
| य | 1 | |
| त | 1 | |
| ् | 1 | |
| र | 1 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 1 |
Katakana
| Value | Count | Frequency (%) |
| ヤ | 1 | |
| イ | 1 | |
| ダ | 1 | |
| ド | 1 | |
| ン | 1 | |
| モ | 1 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵒ | 1 |
Misc Technical
| Value | Count | Frequency (%) |
| ⎜ | 1 |
Phonetic Ext Sup
| Value | Count | Frequency (%) |
| ᶜ | 1 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♥ | 1 |
| Distinct | 65997 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 127.9 MiB |
| ["Europe", "United Kingdom (UK)", "England", "London"] | 22942 |
|---|---|
| ["Europe", "France", "Ile-de-France", "Paris"] | 18129 |
| ["Europe", "Italy", "Lazio", "Rome"] | 12603 |
| ["Europe", "Spain", "Community of Madrid", "Madrid"] | 12134 |
| ["Europe", "Spain", "Catalonia", "Province of Barcelona", "Barcelona"] | 10285 |
| Other values (65992) |
Length
| Max length | 151 |
|---|---|
| Median length | 67 |
| Mean length | 66.78260601 |
| Min length | 19 |
Characters and Unicode
| Total characters | 72352075 |
|---|---|
| Distinct characters | 71 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 21413 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | ["Europe", "France", "Nouvelle-Aquitaine", "Haute-Vienne", "Saint-Jouvent"] |
|---|---|
| 2nd row | ["Europe", "France", "Nouvelle-Aquitaine", "Haute-Vienne", "Saint-Jouvent"] |
| 3rd row | ["Europe", "France", "Centre-Val de Loire", "Berry", "Indre", "Rivarennes"] |
| 4th row | ["Europe", "France", "Nouvelle-Aquitaine", "Correze", "Lacelle"] |
| 5th row | ["Europe", "France", "Occitanie", "Aveyron", "Saint-Laurent-de-Levezou"] |
Common Values
| Value | Count | Frequency (%) |
| ["Europe", "United Kingdom (UK)", "England", "London"] | 22942 | 2.1% |
| ["Europe", "France", "Ile-de-France", "Paris"] | 18129 | 1.7% |
| ["Europe", "Italy", "Lazio", "Rome"] | 12603 | 1.2% |
| ["Europe", "Spain", "Community of Madrid", "Madrid"] | 12134 | 1.1% |
| ["Europe", "Spain", "Catalonia", "Province of Barcelona", "Barcelona"] | 10285 | 0.9% |
| ["Europe", "Italy", "Lombardy", "Milan"] | 8382 | 0.8% |
| ["Europe", "Germany", "Berlin"] | 7217 | 0.7% |
| ["Europe", "Czech Republic", "Bohemia", "Prague"] | 6035 | 0.6% |
| ["Europe", "Portugal", "Central Portugal", "Lisbon District", "Lisbon"] | 5261 | 0.5% |
| ["Europe", "Austria", "Vienna Region", "Vienna"] | 4571 | 0.4% |
| Other values (65987) | 975838 |
Length
| Value | Count | Frequency (%) |
| europe | 1083397 | 14.7% |
| province | 381688 | 5.2% |
| of | 336316 | 4.6% |
| italy | 224763 | 3.0% |
| kingdom | 171664 | 2.3% |
| uk | 171664 | 2.3% |
| united | 171664 | 2.3% |
| spain | 157486 | 2.1% |
| france | 155288 | 2.1% |
| england | 144681 | 2.0% |
| Other values (61028) | 4384880 |
Most occurring characters
| Value | Count | Frequency (%) |
| " | 10516036 | |
| 6300149 | 8.7% | |
| e | 5091386 | 7.0% |
| a | 4329028 | 6.0% |
| , | 4174631 | 5.8% |
| o | 4039912 | 5.6% |
| r | 4013033 | 5.5% |
| n | 3815234 | 5.3% |
| i | 2982907 | 4.1% |
| u | 2053406 | 2.8% |
| Other values (61) | 25036353 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40851093 | |
| Other Punctuation | 14739954 | 20.4% |
| Uppercase Letter | 7462877 | 10.3% |
| Space Separator | 6300149 | 8.7% |
| Close Punctuation | 1256228 | 1.7% |
| Open Punctuation | 1256228 | 1.7% |
| Dash Punctuation | 484731 | 0.7% |
| Decimal Number | 815 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5091386 | |
| a | 4329028 | |
| o | 4039912 | |
| r | 4013033 | |
| n | 3815234 | |
| i | 2982907 | 7.3% |
| u | 2053406 | 5.0% |
| l | 2019889 | 4.9% |
| t | 1928588 | 4.7% |
| p | 1500452 | 3.7% |
| Other values (16) | 9077258 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1317168 | |
| P | 737802 | 9.9% |
| S | 527168 | 7.1% |
| C | 515208 | 6.9% |
| K | 388069 | 5.2% |
| A | 384463 | 5.2% |
| U | 378131 | 5.1% |
| B | 348781 | 4.7% |
| I | 347383 | 4.7% |
| F | 320957 | 4.3% |
| Other values (16) | 2197747 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 307 | |
| 2 | 221 | |
| 1 | 144 | |
| 3 | 99 | 12.1% |
| 9 | 42 | 5.2% |
| 7 | 1 | 0.1% |
| 4 | 1 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 10516036 | |
| , | 4174631 | 28.3% |
| ' | 48258 | 0.3% |
| . | 606 | < 0.1% |
| \ | 422 | < 0.1% |
| & | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1083397 | |
| ) | 172831 | 13.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1083397 | |
| ( | 172831 | 13.8% |
Space Separator
| Value | Count | Frequency (%) |
| 6300149 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 484731 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48313970 | |
| Common | 24038105 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5091386 | 10.5% |
| a | 4329028 | 9.0% |
| o | 4039912 | 8.4% |
| r | 4013033 | 8.3% |
| n | 3815234 | 7.9% |
| i | 2982907 | 6.2% |
| u | 2053406 | 4.3% |
| l | 2019889 | 4.2% |
| t | 1928588 | 4.0% |
| p | 1500452 | 3.1% |
| Other values (42) | 16540135 |
Common
| Value | Count | Frequency (%) |
| " | 10516036 | |
| 6300149 | ||
| , | 4174631 | 17.4% |
| ] | 1083397 | 4.5% |
| [ | 1083397 | 4.5% |
| - | 484731 | 2.0% |
| ) | 172831 | 0.7% |
| ( | 172831 | 0.7% |
| ' | 48258 | 0.2% |
| . | 606 | < 0.1% |
| Other values (9) | 1238 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72352075 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| " | 10516036 | |
| 6300149 | 8.7% | |
| e | 5091386 | 7.0% |
| a | 4329028 | 6.0% |
| , | 4174631 | 5.8% |
| o | 4039912 | 5.6% |
| r | 4013033 | 5.5% |
| n | 3815234 | 5.3% |
| i | 2982907 | 4.1% |
| u | 2053406 | 2.8% |
| Other values (61) | 25036353 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 MiB |
| Italy | |
|---|---|
| Spain | |
| France | |
| England | |
| Germany | |
| Other values (19) |
Length
| Max length | 16 |
|---|---|
| Median length | 6 |
| Mean length | 6.460420326 |
| Min length | 5 |
Characters and Unicode
| Total characters | 6999200 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | France |
|---|---|
| 2nd row | France |
| 3rd row | France |
| 4th row | France |
| 5th row | France |
Common Values
| Value | Count | Frequency (%) |
| Italy | 224763 | |
| Spain | 157479 | |
| France | 155288 | |
| England | 144681 | |
| Germany | 115333 | |
| Greece | 33763 | 3.1% |
| Portugal | 32592 | 3.0% |
| The Netherlands | 29792 | 2.7% |
| Poland | 24698 | 2.3% |
| Belgium | 23711 | 2.2% |
| Other values (14) | 141297 |
Length
| Value | Count | Frequency (%) |
| italy | 224763 | |
| spain | 157479 | |
| france | 155288 | |
| england | 144681 | |
| germany | 115333 | |
| greece | 33763 | 3.0% |
| portugal | 32592 | 2.9% |
| the | 29792 | 2.6% |
| netherlands | 29792 | 2.6% |
| poland | 24698 | 2.2% |
| Other values (16) | 183486 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1017461 | |
| n | 862695 | |
| e | 588885 | 8.4% |
| l | 549359 | 7.8% |
| r | 439120 | 6.3% |
| y | 347527 | 5.0% |
| t | 333858 | 4.8% |
| d | 254150 | 3.6% |
| i | 248830 | 3.6% |
| I | 239600 | 3.4% |
| Other values (28) | 2117715 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5819263 | |
| Uppercase Letter | 1131667 | 16.2% |
| Space Separator | 48270 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1017461 | |
| n | 862695 | |
| e | 588885 | |
| l | 549359 | |
| r | 439120 | |
| y | 347527 | 6.0% |
| t | 333858 | 5.7% |
| d | 254150 | 4.4% |
| i | 248830 | 4.3% |
| c | 232954 | 4.0% |
| Other values (12) | 944424 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 239600 | |
| S | 194500 | |
| F | 162660 | |
| G | 149096 | |
| E | 144681 | |
| P | 57290 | 5.1% |
| N | 33426 | 3.0% |
| T | 29792 | 2.6% |
| B | 28180 | 2.5% |
| C | 23219 | 2.1% |
| Other values (5) | 69223 | 6.1% |
Space Separator
| Value | Count | Frequency (%) |
| 48270 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6950930 | |
| Common | 48270 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1017461 | |
| n | 862695 | |
| e | 588885 | 8.5% |
| l | 549359 | 7.9% |
| r | 439120 | 6.3% |
| y | 347527 | 5.0% |
| t | 333858 | 4.8% |
| d | 254150 | 3.7% |
| i | 248830 | 3.6% |
| I | 239600 | 3.4% |
| Other values (27) | 2069445 |
Common
| Value | Count | Frequency (%) |
| 48270 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6999200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1017461 | |
| n | 862695 | |
| e | 588885 | 8.4% |
| l | 549359 | 7.8% |
| r | 439120 | 6.3% |
| y | 347527 | 5.0% |
| t | 333858 | 4.8% |
| d | 254150 | 3.6% |
| i | 248830 | 3.6% |
| I | 239600 | 3.4% |
| Other values (28) | 2117715 |
| Distinct | 250 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 50323 |
| Missing (%) | 4.6% |
| Memory size | 69.1 MiB |
| Lombardy | 33097 |
|---|---|
| Ile-de-France | 31271 |
| Andalucia | 29562 |
| Catalonia | 28569 |
| Lazio | 23831 |
| Other values (245) |
Length
| Max length | 28 |
|---|---|
| Median length | 9 |
| Mean length | 11.56804837 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11950650 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Nouvelle-Aquitaine |
|---|---|
| 2nd row | Nouvelle-Aquitaine |
| 3rd row | Centre-Val de Loire |
| 4th row | Nouvelle-Aquitaine |
| 5th row | Occitanie |
Common Values
| Value | Count | Frequency (%) |
| Lombardy | 33097 | 3.1% |
| Ile-de-France | 31271 | 2.9% |
| Andalucia | 29562 | 2.7% |
| Catalonia | 28569 | 2.6% |
| Lazio | 23831 | 2.2% |
| London | 22942 | 2.1% |
| Bavaria | 21531 | 2.0% |
| North Rhine-Westphalia | 21116 | 1.9% |
| Auvergne-Rhone-Alpes | 20753 | 1.9% |
| Provence-Alpes-Cote d'Azur | 19925 | 1.8% |
| Other values (240) | 780477 | |
| (Missing) | 50323 | 4.6% |
Length
| Value | Count | Frequency (%) |
| of | 33839 | 2.4% |
| lombardy | 33097 | 2.3% |
| central | 32835 | 2.3% |
| ile-de-france | 31271 | 2.2% |
| andalucia | 29562 | 2.1% |
| london | 29388 | 2.0% |
| catalonia | 28569 | 2.0% |
| islands | 28244 | 2.0% |
| poland | 24698 | 1.7% |
| north | 24377 | 1.7% |
| Other values (262) | 1138158 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1296788 | 10.9% |
| e | 1104772 | 9.2% |
| n | 935839 | 7.8% |
| r | 806044 | 6.7% |
| i | 784125 | 6.6% |
| o | 698710 | 5.8% |
| t | 593900 | 5.0% |
| l | 567339 | 4.7% |
| s | 461365 | 3.9% |
| d | 408191 | 3.4% |
| Other values (43) | 4293577 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9656415 | |
| Uppercase Letter | 1601456 | 13.4% |
| Space Separator | 400964 | 3.4% |
| Dash Punctuation | 270679 | 2.3% |
| Other Punctuation | 21136 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1296788 | |
| e | 1104772 | |
| n | 935839 | |
| r | 806044 | |
| i | 784125 | 8.1% |
| o | 698710 | 7.2% |
| t | 593900 | 6.2% |
| l | 567339 | 5.9% |
| s | 461365 | 4.8% |
| d | 408191 | 4.2% |
| Other values (16) | 1999342 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 207371 | |
| A | 205346 | |
| L | 145737 | 9.1% |
| P | 128733 | 8.0% |
| B | 106354 | 6.6% |
| S | 90956 | 5.7% |
| R | 83882 | 5.2% |
| W | 76469 | 4.8% |
| N | 70134 | 4.4% |
| M | 65597 | 4.1% |
| Other values (14) | 420877 |
Space Separator
| Value | Count | Frequency (%) |
| 400964 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 270679 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 21136 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11257871 | |
| Common | 692779 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1296788 | 11.5% |
| e | 1104772 | 9.8% |
| n | 935839 | 8.3% |
| r | 806044 | 7.2% |
| i | 784125 | 7.0% |
| o | 698710 | 6.2% |
| t | 593900 | 5.3% |
| l | 567339 | 5.0% |
| s | 461365 | 4.1% |
| d | 408191 | 3.6% |
| Other values (40) | 3600798 |
Common
| Value | Count | Frequency (%) |
| 400964 | ||
| - | 270679 | |
| ' | 21136 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11950650 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1296788 | 10.9% |
| e | 1104772 | 9.2% |
| n | 935839 | 7.8% |
| r | 806044 | 6.7% |
| i | 784125 | 6.6% |
| o | 698710 | 5.8% |
| t | 593900 | 5.0% |
| l | 567339 | 4.7% |
| s | 461365 | 3.9% |
| d | 408191 | 3.4% |
| Other values (43) | 4293577 |
| Distinct | 1333 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 340632 |
| Missing (%) | 31.4% |
| Memory size | 62.0 MiB |
| Province of Barcelona | 18952 |
|---|---|
| Province of Malaga | 10056 |
| Province of Alicante | 9137 |
| Province of Naples | 8962 |
| Upper Bavaria | 8584 |
| Other values (1328) |
Length
| Max length | 43 |
|---|---|
| Median length | 17 |
| Mean length | 15.90769759 |
| Min length | 3 |
Characters and Unicode
| Total characters | 11815681 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 125 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Haute-Vienne |
|---|---|
| 2nd row | Haute-Vienne |
| 3rd row | Berry |
| 4th row | Correze |
| 5th row | Aveyron |
Common Values
| Value | Count | Frequency (%) |
| Province of Barcelona | 18952 | 1.7% |
| Province of Malaga | 10056 | 0.9% |
| Province of Alicante | 9137 | 0.8% |
| Province of Naples | 8962 | 0.8% |
| Upper Bavaria | 8584 | 0.8% |
| Lisbon District | 8223 | 0.8% |
| French Riviera - Cote d'Azur | 8156 | 0.8% |
| Province of Turin | 7846 | 0.7% |
| North Holland Province | 7620 | 0.7% |
| Province of Valencia | 7337 | 0.7% |
| Other values (1323) | 647892 | |
| (Missing) | 340632 |
Length
| Value | Count | Frequency (%) |
| province | 371190 | |
| of | 299513 | 17.5% |
| county | 49203 | 2.9% |
| district | 33745 | 2.0% |
| region | 24735 | 1.4% |
| barcelona | 18952 | 1.1% |
| north | 16813 | 1.0% |
| south | 16341 | 1.0% |
| riviera | 15840 | 0.9% |
| yorkshire | 13672 | 0.8% |
| Other values (1463) | 848979 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1219746 | 10.3% |
| e | 1089141 | 9.2% |
| 966218 | 8.2% | |
| r | 938288 | 7.9% |
| i | 916432 | 7.8% |
| n | 909976 | 7.7% |
| a | 830518 | 7.0% |
| c | 550953 | 4.7% |
| v | 453260 | 3.8% |
| P | 435671 | 3.7% |
| Other values (47) | 3505478 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9254671 | |
| Uppercase Letter | 1466596 | 12.4% |
| Space Separator | 966218 | 8.2% |
| Dash Punctuation | 114832 | 1.0% |
| Other Punctuation | 13360 | 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1219746 | |
| e | 1089141 | |
| r | 938288 | |
| i | 916432 | |
| n | 909976 | |
| a | 830518 | |
| c | 550953 | 6.0% |
| v | 453260 | 4.9% |
| t | 392009 | 4.2% |
| f | 321533 | 3.5% |
| Other values (16) | 1632815 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 435671 | |
| C | 130457 | 8.9% |
| S | 88680 | 6.0% |
| B | 83698 | 5.7% |
| R | 75606 | 5.2% |
| L | 71353 | 4.9% |
| M | 64552 | 4.4% |
| A | 62645 | 4.3% |
| D | 54253 | 3.7% |
| V | 51440 | 3.5% |
| Other values (16) | 348241 |
Space Separator
| Value | Count | Frequency (%) |
| 966218 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 114832 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 13360 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10721267 | |
| Common | 1094414 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1219746 | |
| e | 1089141 | 10.2% |
| r | 938288 | 8.8% |
| i | 916432 | 8.5% |
| n | 909976 | 8.5% |
| a | 830518 | 7.7% |
| c | 550953 | 5.1% |
| v | 453260 | 4.2% |
| P | 435671 | 4.1% |
| t | 392009 | 3.7% |
| Other values (42) | 2985273 |
Common
| Value | Count | Frequency (%) |
| 966218 | ||
| - | 114832 | 10.5% |
| ' | 13360 | 1.2% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11815681 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1219746 | 10.3% |
| e | 1089141 | 9.2% |
| 966218 | 8.2% | |
| r | 938288 | 7.9% |
| i | 916432 | 7.8% |
| n | 909976 | 7.7% |
| a | 830518 | 7.0% |
| c | 550953 | 4.7% |
| v | 453260 | 3.8% |
| P | 435671 | 3.7% |
| Other values (47) | 3505478 |
| Distinct | 43495 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 400685 |
| Missing (%) | 37.0% |
| Memory size | 55.2 MiB |
| Paris | 18129 |
|---|---|
| Rome | 12603 |
| Madrid | 12134 |
| Milan | 8382 |
| Prague | 6035 |
| Other values (43490) |
Length
| Max length | 48 |
|---|---|
| Median length | 8 |
| Mean length | 9.03642971 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6169279 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14579 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | Saint-Jouvent |
|---|---|
| 2nd row | Saint-Jouvent |
| 3rd row | Rivarennes |
| 4th row | Lacelle |
| 5th row | Saint-Laurent-de-Levezou |
Common Values
| Value | Count | Frequency (%) |
| Paris | 18129 | 1.7% |
| Rome | 12603 | 1.2% |
| Madrid | 12134 | 1.1% |
| Milan | 8382 | 0.8% |
| Prague | 6035 | 0.6% |
| Lisbon | 5261 | 0.5% |
| Vienna | 4571 | 0.4% |
| Amsterdam | 4352 | 0.4% |
| Budapest | 3557 | 0.3% |
| Munich | 3508 | 0.3% |
| Other values (43485) | 604180 | |
| (Missing) | 400685 |
Length
| Value | Count | Frequency (%) |
| paris | 18129 | 2.2% |
| de | 15706 | 1.9% |
| rome | 12603 | 1.5% |
| madrid | 12156 | 1.5% |
| milan | 8382 | 1.0% |
| la | 7449 | 0.9% |
| prague | 6035 | 0.7% |
| lisbon | 5266 | 0.6% |
| vienna | 4571 | 0.6% |
| amsterdam | 4352 | 0.5% |
| Other values (42902) | 733574 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 648027 | 10.5% |
| a | 564130 | 9.1% |
| r | 447685 | 7.3% |
| n | 438152 | 7.1% |
| o | 387285 | 6.3% |
| i | 375702 | 6.1% |
| l | 306983 | 5.0% |
| s | 297756 | 4.8% |
| t | 260206 | 4.2% |
| u | 204480 | 3.3% |
| Other values (56) | 2238873 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5079293 | |
| Uppercase Letter | 850304 | 13.8% |
| Space Separator | 145515 | 2.4% |
| Dash Punctuation | 86088 | 1.4% |
| Other Punctuation | 5413 | 0.1% |
| Open Punctuation | 1029 | < 0.1% |
| Close Punctuation | 1029 | < 0.1% |
| Decimal Number | 608 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 648027 | |
| a | 564130 | |
| r | 447685 | 8.8% |
| n | 438152 | 8.6% |
| o | 387285 | 7.6% |
| i | 375702 | 7.4% |
| l | 306983 | 6.0% |
| s | 297756 | 5.9% |
| t | 260206 | 5.1% |
| u | 204480 | 4.0% |
| Other values (16) | 1148887 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 83041 | 9.8% |
| M | 81543 | 9.6% |
| P | 72389 | 8.5% |
| B | 71112 | 8.4% |
| L | 66710 | 7.8% |
| C | 64687 | 7.6% |
| A | 52999 | 6.2% |
| R | 42362 | 5.0% |
| T | 34598 | 4.1% |
| H | 33679 | 4.0% |
| Other values (16) | 247184 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 4677 | |
| . | 582 | 10.8% |
| \ | 150 | 2.8% |
| , | 3 | 0.1% |
| & | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 198 | |
| 2 | 158 | |
| 1 | 126 | |
| 3 | 95 | |
| 9 | 31 | 5.1% |
Space Separator
| Value | Count | Frequency (%) |
| 145515 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86088 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1029 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1029 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5929597 | |
| Common | 239682 | 3.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 648027 | 10.9% |
| a | 564130 | 9.5% |
| r | 447685 | 7.6% |
| n | 438152 | 7.4% |
| o | 387285 | 6.5% |
| i | 375702 | 6.3% |
| l | 306983 | 5.2% |
| s | 297756 | 5.0% |
| t | 260206 | 4.4% |
| u | 204480 | 3.4% |
| Other values (42) | 1999191 |
Common
| Value | Count | Frequency (%) |
| 145515 | ||
| - | 86088 | |
| ' | 4677 | 2.0% |
| ( | 1029 | 0.4% |
| ) | 1029 | 0.4% |
| . | 582 | 0.2% |
| 0 | 198 | 0.1% |
| 2 | 158 | 0.1% |
| \ | 150 | 0.1% |
| 1 | 126 | 0.1% |
| Other values (4) | 130 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6169279 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 648027 | 10.5% |
| a | 564130 | 9.1% |
| r | 447685 | 7.3% |
| n | 438152 | 7.1% |
| o | 387285 | 6.3% |
| i | 375702 | 6.1% |
| l | 306983 | 5.0% |
| s | 297756 | 4.8% |
| t | 260206 | 4.2% |
| u | 204480 | 3.3% |
| Other values (56) | 2238873 |
| Distinct | 1034685 |
|---|---|
| Distinct (%) | 95.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 112.5 MiB |
| Greece | 92 |
|---|---|
| Tsilivi (Planos) 29100 Greece | 29 |
| Brasschaat 2930 Belgium | 24 |
| Fira 84700 Greece | 23 |
| Sidari 49081 Greece | 23 |
| Other values (1034680) |
Length
| Max length | 382 |
|---|---|
| Median length | 47 |
| Mean length | 49.73795755 |
| Min length | 5 |
Characters and Unicode
| Total characters | 53885954 |
|---|---|
| Distinct characters | 387 |
| Distinct categories | 23 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 13 ? |
Unique
| Unique | 995486 ? |
|---|---|
| Unique (%) | 91.9% |
Sample
| 1st row | 10 Maison Neuve, 87510 Saint-Jouvent France |
|---|---|
| 2nd row | 16 Place de l Eglise, 87510 Saint-Jouvent France |
| 3rd row | 2 rue des Dames, 36800 Rivarennes France |
| 4th row | 9 avenue Porte de la Correze 19170, 19170 Lacelle France |
| 5th row | route du Montseigne, 12620 Saint-Laurent-de-Levezou France |
Common Values
| Value | Count | Frequency (%) |
| Greece | 92 | < 0.1% |
| Tsilivi (Planos) 29100 Greece | 29 | < 0.1% |
| Brasschaat 2930 Belgium | 24 | < 0.1% |
| Fira 84700 Greece | 23 | < 0.1% |
| Sidari 49081 Greece | 23 | < 0.1% |
| Lindos 85107 Greece | 22 | < 0.1% |
| Skala 85500 Greece | 21 | < 0.1% |
| London England | 19 | < 0.1% |
| Pefkohori 63085 Greece | 19 | < 0.1% |
| Oia 84702 Greece | 19 | < 0.1% |
| Other values (1034675) | 1083106 |
Length
| Value | Count | Frequency (%) |
| italy | 224837 | 2.7% |
| spain | 157642 | 1.9% |
| de | 156353 | 1.9% |
| france | 155984 | 1.9% |
| via | 154056 | 1.9% |
| england | 144755 | 1.8% |
| germany | 115368 | 1.4% |
| rue | 77452 | 0.9% |
| calle | 63329 | 0.8% |
| road | 62905 | 0.8% |
| Other values (390553) | 6942449 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7172662 | 13.3% | |
| a | 4595436 | 8.5% |
| e | 3928085 | 7.3% |
| n | 2781382 | 5.2% |
| r | 2730516 | 5.1% |
| i | 2325435 | 4.3% |
| l | 2225854 | 4.1% |
| o | 2148462 | 4.0% |
| t | 1877320 | 3.5% |
| , | 1535166 | 2.8% |
| Other values (377) | 22565636 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31694793 | |
| Space Separator | 7172694 | 13.3% |
| Decimal Number | 6591218 | 12.2% |
| Uppercase Letter | 6389813 | 11.9% |
| Other Punctuation | 1754765 | 3.3% |
| Dash Punctuation | 272139 | 0.5% |
| Other Letter | 3230 | < 0.1% |
| Close Punctuation | 1827 | < 0.1% |
| Open Punctuation | 1819 | < 0.1% |
| Other Symbol | 1353 | < 0.1% |
| Other values (13) | 2303 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4595436 | |
| e | 3928085 | |
| n | 2781382 | |
| r | 2730516 | |
| i | 2325435 | 7.3% |
| l | 2225854 | 7.0% |
| o | 2148462 | 6.8% |
| t | 1877320 | 5.9% |
| s | 1332502 | 4.2% |
| d | 1259565 | 4.0% |
| Other values (150) | 6490236 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 727811 | 11.4% |
| C | 496367 | 7.8% |
| P | 454929 | 7.1% |
| B | 406951 | 6.4% |
| A | 366394 | 5.7% |
| G | 358728 | 5.6% |
| R | 342416 | 5.4% |
| M | 329036 | 5.1% |
| L | 317771 | 5.0% |
| I | 311869 | 4.9% |
| Other values (107) | 2277541 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1535166 | |
| . | 128766 | 7.3% |
| / | 50412 | 2.9% |
| ' | 35648 | 2.0% |
| & | 2870 | 0.2% |
| : | 836 | < 0.1% |
| # | 368 | < 0.1% |
| " | 231 | < 0.1% |
| ; | 132 | < 0.1% |
| \ | 99 | < 0.1% |
| Other values (11) | 237 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1310960 | |
| 1 | 1105455 | |
| 2 | 773861 | |
| 3 | 645141 | |
| 4 | 559714 | |
| 5 | 519944 | 7.9% |
| 6 | 447462 | 6.8% |
| 8 | 438512 | 6.7% |
| 7 | 436993 | 6.6% |
| 9 | 353176 | 5.4% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̈ | 68 | |
| ́ | 67 | |
| ̀ | 14 | 8.2% |
| ̌ | 9 | 5.3% |
| ̃ | 5 | 2.9% |
| ̂ | 3 | 1.8% |
| ̊ | 2 | 1.2% |
| ̧ | 1 | 0.6% |
| ̨ | 1 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 291 | |
| | | 84 | 21.4% |
| ± | 7 | 1.8% |
| = | 3 | 0.8% |
| < | 2 | 0.5% |
| > | 2 | 0.5% |
| ⁄ | 2 | 0.5% |
| ~ | 1 | 0.3% |
Other Letter
| Value | Count | Frequency (%) |
| º | 3136 | |
| ª | 89 | 2.8% |
| く | 1 | < 0.1% |
| 近 | 1 | < 0.1% |
| ぐ | 1 | < 0.1% |
| す | 1 | < 0.1% |
| の | 1 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1277 | |
| © | 38 | 2.8% |
| � | 21 | 1.6% |
| № | 11 | 0.8% |
|  | 4 | 0.3% |
| ® | 1 | 0.1% |
| ¦ | 1 | 0.1% |
Format
| Value | Count | Frequency (%) |
| | 78 | |
| | 60 | |
| | 27 | 14.9% |
| | 8 | 4.4% |
| | 4 | 2.2% |
| | 3 | 1.7% |
| | 1 | 0.6% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 117 | |
| ` | 21 | 14.1% |
| ¨ | 7 | 4.7% |
| ^ | 2 | 1.3% |
| ΄ | 1 | 0.7% |
| ˚ | 1 | 0.7% |
Control
| Value | Count | Frequency (%) |
| 11 | ||
| 5 | ||
| | 2 | 10.0% |
| | 1 | 5.0% |
| | 1 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 271795 | |
| – | 340 | 0.1% |
| ‐ | 3 | < 0.1% |
| — | 1 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ³ | 10 | |
| ⁰ | 3 | 18.8% |
| ⁷ | 2 | 12.5% |
| ½ | 1 | 6.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1825 | |
| } | 1 | 0.1% |
| ] | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1812 | |
| „ | 6 | 0.3% |
| [ | 1 | 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1235 | |
| ” | 25 | 2.0% |
| » | 9 | 0.7% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 31 | |
| ‘ | 24 | |
| « | 9 | 14.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7172662 | ||
| 32 | < 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 2 | |
| ᵒ | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 1 | |
| £ | 1 |
Letter Number
| Value | Count | Frequency (%) |
| Ⅲ | 1 | |
| Ⅰ | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 32 |
Line Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38082428 | |
| Common | 15797943 | |
| Greek | 4994 | < 0.1% |
| Cyrillic | 414 | < 0.1% |
| Inherited | 170 | < 0.1% |
| Hiragana | 4 | < 0.1% |
| Han | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4595436 | 12.1% |
| e | 3928085 | 10.3% |
| n | 2781382 | 7.3% |
| r | 2730516 | 7.2% |
| i | 2325435 | 6.1% |
| l | 2225854 | 5.8% |
| o | 2148462 | 5.6% |
| t | 1877320 | 4.9% |
| s | 1332502 | 3.5% |
| d | 1259565 | 3.3% |
| Other values (166) | 12877871 |
Common
| Value | Count | Frequency (%) |
| 7172662 | ||
| , | 1535166 | 9.7% |
| 0 | 1310960 | 8.3% |
| 1 | 1105455 | 7.0% |
| 2 | 773861 | 4.9% |
| 3 | 645141 | 4.1% |
| 4 | 559714 | 3.5% |
| 5 | 519944 | 3.3% |
| 6 | 447462 | 2.8% |
| 8 | 438512 | 2.8% |
| Other values (80) | 1289066 | 8.2% |
Greek
| Value | Count | Frequency (%) |
| α | 501 | 10.0% |
| ο | 434 | 8.7% |
| ρ | 299 | 6.0% |
| υ | 266 | 5.3% |
| ι | 234 | 4.7% |
| ν | 232 | 4.6% |
| λ | 192 | 3.8% |
| ε | 185 | 3.7% |
| τ | 185 | 3.7% |
| ς | 161 | 3.2% |
| Other values (51) | 2305 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 45 | 10.9% |
| и | 33 | 8.0% |
| о | 31 | 7.5% |
| р | 30 | 7.2% |
| л | 28 | 6.8% |
| е | 23 | 5.6% |
| у | 20 | 4.8% |
| н | 17 | 4.1% |
| с | 17 | 4.1% |
| в | 17 | 4.1% |
| Other values (36) | 153 |
Inherited
| Value | Count | Frequency (%) |
| ̈ | 68 | |
| ́ | 67 | |
| ̀ | 14 | 8.2% |
| ̌ | 9 | 5.3% |
| ̃ | 5 | 2.9% |
| ̂ | 3 | 1.8% |
| ̊ | 2 | 1.2% |
| ̧ | 1 | 0.6% |
| ̨ | 1 | 0.6% |
Hiragana
| Value | Count | Frequency (%) |
| く | 1 | |
| ぐ | 1 | |
| す | 1 | |
| の | 1 |
Han
| Value | Count | Frequency (%) |
| 近 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53783852 | |
| None | 99634 | 0.2% |
| Punctuation | 1834 | < 0.1% |
| Cyrillic | 414 | < 0.1% |
| Diacriticals | 170 | < 0.1% |
| Specials | 25 | < 0.1% |
| Letterlike Symbols | 11 | < 0.1% |
| Hiragana | 4 | < 0.1% |
| Modifier Letters | 3 | < 0.1% |
| Phonetic Ext | 2 | < 0.1% |
| Other values (3) | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7172662 | 13.3% | |
| a | 4595436 | 8.5% |
| e | 3928085 | 7.3% |
| n | 2781382 | 5.2% |
| r | 2730516 | 5.1% |
| i | 2325435 | 4.3% |
| l | 2225854 | 4.1% |
| o | 2148462 | 4.0% |
| t | 1877320 | 3.5% |
| , | 1535166 | 2.9% |
| Other values (86) | 22463534 |
None
| Value | Count | Frequency (%) |
| ü | 21699 | |
| é | 9424 | 9.5% |
| á | 6118 | 6.1% |
| í | 5701 | 5.7% |
| ß | 5606 | 5.6% |
| ä | 5169 | 5.2% |
| ö | 5011 | 5.0% |
| ó | 4250 | 4.3% |
| º | 3136 | 3.1% |
| à | 2817 | 2.8% |
| Other values (193) | 30703 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1235 | |
| – | 340 | 18.5% |
| | 78 | 4.3% |
| | 60 | 3.3% |
| “ | 31 | 1.7% |
| ” | 25 | 1.4% |
| ‘ | 24 | 1.3% |
| | 8 | 0.4% |
| • | 8 | 0.4% |
| „ | 6 | 0.3% |
| Other values (8) | 19 | 1.0% |
Diacriticals
| Value | Count | Frequency (%) |
| ̈ | 68 | |
| ́ | 67 | |
| ̀ | 14 | 8.2% |
| ̌ | 9 | 5.3% |
| ̃ | 5 | 2.9% |
| ̂ | 3 | 1.8% |
| ̊ | 2 | 1.2% |
| ̧ | 1 | 0.6% |
| ̨ | 1 | 0.6% |
Cyrillic
| Value | Count | Frequency (%) |
| а | 45 | 10.9% |
| и | 33 | 8.0% |
| о | 31 | 7.5% |
| р | 30 | 7.2% |
| л | 28 | 6.8% |
| е | 23 | 5.6% |
| у | 20 | 4.8% |
| н | 17 | 4.1% |
| с | 17 | 4.1% |
| в | 17 | 4.1% |
| Other values (36) | 153 |
Specials
| Value | Count | Frequency (%) |
| � | 21 | |
|  | 4 | 16.0% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 11 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 2 | |
| ˚ | 1 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵒ | 2 |
Hiragana
| Value | Count | Frequency (%) |
| く | 1 | |
| ぐ | 1 | |
| す | 1 | |
| の | 1 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅲ | 1 | |
| Ⅰ | 1 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ṓ | 1 | |
| ḯ | 1 |
CJK
| Value | Count | Frequency (%) |
| 近 | 1 |
| Distinct | 857920 |
|---|---|
| Distinct (%) | 80.4% |
| Missing | 15790 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.56718192 |
| Minimum | 27.64031 |
|---|---|
| Maximum | 69.94156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 27.64031 |
|---|---|
| 5-th percentile | 37.3463932 |
| Q1 | 41.90986 |
| median | 46.5851 |
| Q3 | 51.4053675 |
| 95-th percentile | 54.8690489 |
| Maximum | 69.94156 |
| Range | 42.30125 |
| Interquartile range (IQR) | 9.4955075 |
Descriptive statistics
| Standard deviation | 5.882611005 |
|---|---|
| Coefficient of variation (CV) | 0.1263252523 |
| Kurtosis | -0.06035665257 |
| Mean | 46.56718192 |
| Median Absolute Deviation (MAD) | 4.73456 |
| Skewness | -0.2063293143 |
| Sum | 49715449.38 |
| Variance | 34.60511223 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.00986 | 175 | < 0.1% |
| 51.12472 | 140 | < 0.1% |
| 45.003227 | 112 | < 0.1% |
| 52.41879 | 72 | < 0.1% |
| 40.429913 | 54 | < 0.1% |
| 55.62955 | 42 | < 0.1% |
| 46.698483 | 40 | < 0.1% |
| 40.09095 | 37 | < 0.1% |
| 41.40035 | 34 | < 0.1% |
| 51.12626 | 33 | < 0.1% |
| Other values (857910) | 1066868 | |
| (Missing) | 15790 | 1.5% |
| Value | Count | Frequency (%) |
| 27.64031 | 1 | |
| 27.64053 | 1 | |
| 27.64057 | 1 | |
| 27.640947 | 1 | |
| 27.640959 | 1 | |
| 27.64106 | 1 | |
| 27.64141 | 1 | |
| 27.641466 | 1 | |
| 27.641989 | 1 | |
| 27.699686 | 1 |
| Value | Count | Frequency (%) |
| 69.94156 | 1 | |
| 69.926575 | 1 | |
| 69.907166 | 1 | |
| 69.90655 | 1 | |
| 69.89453 | 1 | |
| 69.50826 | 1 | |
| 69.39988 | 1 | |
| 69.399185 | 1 | |
| 69.39917 | 1 | |
| 69.25132 | 1 |
| Distinct | 969586 |
|---|---|
| Distinct (%) | 90.8% |
| Missing | 15790 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.838040356 |
| Minimum | -71.218094 |
|---|---|
| Maximum | 33.369423 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 335442 |
| Negative (%) | 31.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | -71.218094 |
|---|---|
| 5-th percentile | -7.6241437 |
| Q1 | -0.8027315 |
| median | 5.64653 |
| Q3 | 12.2376745 |
| 95-th percentile | 20.9910173 |
| Maximum | 33.369423 |
| Range | 104.587517 |
| Interquartile range (IQR) | 13.040406 |
Descriptive statistics
| Standard deviation | 8.639410037 |
|---|---|
| Coefficient of variation (CV) | 1.479847605 |
| Kurtosis | -0.2564410915 |
| Mean | 5.838040356 |
| Median Absolute Deviation (MAD) | 6.529056 |
| Skewness | 0.1112689221 |
| Sum | 6232732.751 |
| Variance | 74.63940579 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28.95707 | 175 | < 0.1% |
| -2.74 | 140 | < 0.1% |
| 11.083693 | 112 | < 0.1% |
| -1.247349 | 72 | < 0.1% |
| -3.669245 | 52 | < 0.1% |
| 9.20105 | 41 | < 0.1% |
| 2.549047 | 40 | < 0.1% |
| -3.464618 | 37 | < 0.1% |
| 2.159592 | 34 | < 0.1% |
| -2.740861 | 33 | < 0.1% |
| Other values (969576) | 1066871 | |
| (Missing) | 15790 | 1.5% |
| Value | Count | Frequency (%) |
| -71.218094 | 1 | |
| -31.265543 | 1 | |
| -31.263672 | 1 | |
| -31.26259 | 1 | |
| -31.261559 | 1 | |
| -31.256433 | 1 | |
| -31.255907 | 1 | |
| -31.209183 | 1 | |
| -31.186285 | 1 | |
| -31.17934 | 1 |
| Value | Count | Frequency (%) |
| 33.369423 | 1 | |
| 33.28716 | 1 | |
| 30.938726 | 1 | |
| 30.933966 | 1 | |
| 30.92968 | 1 | |
| 30.929373 | 1 | |
| 30.9291 | 1 | |
| 30.91931 | 1 | |
| 30.337452 | 1 | |
| 30.336582 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1842 |
| Missing (%) | 0.2% |
| Memory size | 67.2 MiB |
| Unclaimed | |
|---|---|
| Claimed |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.122751964 |
| Min length | 7 |
Characters and Unicode
| Total characters | 8785203 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Claimed |
|---|---|
| 2nd row | Unclaimed |
| 3rd row | Claimed |
| 4th row | Claimed |
| 5th row | Unclaimed |
Common Values
| Value | Count | Frequency (%) |
| Unclaimed | 607159 | |
| Claimed | 474396 | |
| (Missing) | 1842 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| unclaimed | 607159 | |
| claimed | 474396 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1081555 | |
| a | 1081555 | |
| i | 1081555 | |
| m | 1081555 | |
| e | 1081555 | |
| d | 1081555 | |
| U | 607159 | |
| n | 607159 | |
| c | 607159 | |
| C | 474396 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7703648 | |
| Uppercase Letter | 1081555 | 12.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1081555 | |
| a | 1081555 | |
| i | 1081555 | |
| m | 1081555 | |
| e | 1081555 | |
| d | 1081555 | |
| n | 607159 | |
| c | 607159 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 607159 | |
| C | 474396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8785203 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1081555 | |
| a | 1081555 | |
| i | 1081555 | |
| m | 1081555 | |
| e | 1081555 | |
| d | 1081555 | |
| U | 607159 | |
| n | 607159 | |
| c | 607159 | |
| C | 474396 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8785203 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1081555 | |
| a | 1081555 | |
| i | 1081555 | |
| m | 1081555 | |
| e | 1081555 | |
| d | 1081555 | |
| U | 607159 | |
| n | 607159 | |
| c | 607159 | |
| C | 474396 |
| Distinct | 917 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 820264 |
| Missing (%) | 75.7% |
| Memory size | 67.8 MiB |
| Travellers' Choice, Certificate of Excellence 2020 | |
|---|---|
| Certificate of Excellence 2017 | 16392 |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019 | 15994 |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016 | 13935 |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017 | 13380 |
| Other values (912) |
Length
| Max length | 380 |
|---|---|
| Median length | 94 |
| Mean length | 113.6036377 |
| Min length | 30 |
Characters and Unicode
| Total characters | 29892866 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 156 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Travellers' Choice, Certificate of Excellence 2020 |
|---|---|
| 2nd row | Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017 |
| 3rd row | Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016, Certificate of Excellence 2015 |
| 4th row | Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016, Certificate of Excellence 2015 |
| 5th row | Certificate of Excellence 2018, Certificate of Excellence 2017 |
Common Values
| Value | Count | Frequency (%) |
| Travellers' Choice, Certificate of Excellence 2020 | 20868 | 1.9% |
| Certificate of Excellence 2017 | 16392 | 1.5% |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019 | 15994 | 1.5% |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016 | 13935 | 1.3% |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017 | 13380 | 1.2% |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018 | 11988 | 1.1% |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016, Certificate of Excellence 2015 | 11107 | 1.0% |
| Certificate of Excellence 2018 | 10011 | 0.9% |
| Certificate of Excellence 2019 | 9446 | 0.9% |
| Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016, Certificate of Excellence 2015, Certificate of Excellence 2014 | 8367 | 0.8% |
| Other values (907) | 131645 | 12.2% |
| (Missing) | 820264 |
Length
| Value | Count | Frequency (%) |
| of | 824284 | |
| certificate | 824119 | |
| excellence | 824119 | |
| travellers | 147648 | 3.9% |
| choice | 147648 | 3.9% |
| 2020 | 144978 | 3.9% |
| 2017 | 142887 | 3.8% |
| 2019 | 139995 | 3.7% |
| 2018 | 138741 | 3.7% |
| 2016 | 107686 | 2.9% |
| Other values (38) | 306349 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4638105 | |
| 3485321 | ||
| c | 2664303 | 8.9% |
| l | 1998943 | 6.7% |
| i | 1880691 | 6.3% |
| t | 1694431 | 5.7% |
| f | 1676984 | 5.6% |
| r | 1178000 | 3.9% |
| o | 1056720 | 3.5% |
| 2 | 1019844 | 3.4% |
| Other values (43) | 8599524 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20110514 | |
| Space Separator | 3485321 | 11.7% |
| Decimal Number | 3384676 | 11.3% |
| Uppercase Letter | 2017106 | 6.7% |
| Other Punctuation | 895249 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4638105 | |
| c | 2664303 | |
| l | 1998943 | |
| i | 1880691 | |
| t | 1694431 | 8.4% |
| f | 1676984 | 8.3% |
| r | 1178000 | 5.9% |
| o | 1056720 | 5.3% |
| a | 1018786 | 5.1% |
| n | 881107 | 4.4% |
| Other values (15) | 1422444 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 976604 | |
| E | 824648 | |
| T | 155142 | 7.7% |
| M | 31130 | 1.5% |
| G | 8978 | 0.4% |
| P | 7162 | 0.4% |
| S | 6339 | 0.3% |
| B | 2146 | 0.1% |
| O | 1672 | 0.1% |
| H | 1672 | 0.1% |
| Other values (2) | 1613 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1019844 | |
| 0 | 991147 | |
| 1 | 705526 | |
| 7 | 142887 | 4.2% |
| 9 | 139995 | 4.1% |
| 8 | 138741 | 4.1% |
| 6 | 107686 | 3.2% |
| 5 | 66340 | 2.0% |
| 4 | 41445 | 1.2% |
| 3 | 31065 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 734418 | |
| ' | 147648 | 16.5% |
| : | 10982 | 1.2% |
| ! | 2004 | 0.2% |
| . | 197 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3485321 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22127620 | |
| Common | 7765246 | 26.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4638105 | |
| c | 2664303 | |
| l | 1998943 | |
| i | 1880691 | |
| t | 1694431 | 7.7% |
| f | 1676984 | 7.6% |
| r | 1178000 | 5.3% |
| o | 1056720 | 4.8% |
| a | 1018786 | 4.6% |
| C | 976604 | 4.4% |
| Other values (27) | 3344053 |
Common
| Value | Count | Frequency (%) |
| 3485321 | ||
| 2 | 1019844 | 13.1% |
| 0 | 991147 | 12.8% |
| , | 734418 | 9.5% |
| 1 | 705526 | 9.1% |
| ' | 147648 | 1.9% |
| 7 | 142887 | 1.8% |
| 9 | 139995 | 1.8% |
| 8 | 138741 | 1.8% |
| 6 | 107686 | 1.4% |
| Other values (6) | 152033 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29892866 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4638105 | |
| 3485321 | ||
| c | 2664303 | 8.9% |
| l | 1998943 | 6.7% |
| i | 1880691 | 6.3% |
| t | 1694431 | 5.7% |
| f | 1676984 | 5.6% |
| r | 1178000 | 3.9% |
| o | 1056720 | 3.5% |
| 2 | 1019844 | 3.4% |
| Other values (43) | 8599524 |
| Distinct | 981409 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 94988 |
| Missing (%) | 8.8% |
| Memory size | 89.5 MiB |
| #7616 of 8661 Restaurants in Barcelona | 119 |
|---|---|
| #8393 of 10193 Restaurants in Madrid | 99 |
| #4081 of 4632 Restaurants in Prague | 90 |
| #5951 of 6682 Restaurants in Milan | 89 |
| #15227 of 17023 Restaurants in London | 85 |
| Other values (981404) |
Length
| Max length | 73 |
|---|---|
| Median length | 34 |
| Mean length | 34.86049095 |
| Min length | 21 |
Characters and Unicode
| Total characters | 34456423 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 977358 ? |
|---|---|
| Unique (%) | 98.9% |
Sample
| 1st row | #1 of 2 Restaurants in Saint-Jouvent |
|---|---|
| 2nd row | #2 of 2 Restaurants in Saint-Jouvent |
| 3rd row | #1 of 1 Restaurant in Rivarennes |
| 4th row | #1 of 1 Restaurant in Lacelle |
| 5th row | #1 of 1 Restaurant in Saint-Laurent-de-Levezou |
Common Values
| Value | Count | Frequency (%) |
| #7616 of 8661 Restaurants in Barcelona | 119 | < 0.1% |
| #8393 of 10193 Restaurants in Madrid | 99 | < 0.1% |
| #4081 of 4632 Restaurants in Prague | 90 | < 0.1% |
| #5951 of 6682 Restaurants in Milan | 89 | < 0.1% |
| #15227 of 17023 Restaurants in London | 85 | < 0.1% |
| #9123 of 10232 Restaurants in Rome | 84 | < 0.1% |
| #14004 of 15476 Restaurants in Paris | 75 | < 0.1% |
| #715 of 860 Restaurants in Brno | 44 | < 0.1% |
| #5233 of 5605 Restaurants in Berlin | 43 | < 0.1% |
| #3443 of 3747 Restaurants in Vienna | 43 | < 0.1% |
| Other values (981399) | 987638 | |
| (Missing) | 94988 | 8.8% |
Length
| Value | Count | Frequency (%) |
| in | 990557 | 15.7% |
| of | 988924 | 15.7% |
| restaurants | 871237 | 13.8% |
| 1 | 122519 | 1.9% |
| 2 | 78697 | 1.2% |
| 3 | 60245 | 1.0% |
| 4 | 49179 | 0.8% |
| 42785 | 0.7% | |
| 5 | 42522 | 0.7% |
| tea | 40643 | 0.6% |
| Other values (74995) | 3030689 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5329632 | ||
| a | 2784779 | 8.1% |
| n | 2544946 | 7.4% |
| s | 2248410 | 6.5% |
| t | 2204087 | 6.4% |
| e | 1972740 | 5.7% |
| o | 1713724 | 5.0% |
| i | 1615019 | 4.7% |
| r | 1557113 | 4.5% |
| u | 1176463 | 3.4% |
| Other values (62) | 11309510 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20990548 | |
| Space Separator | 5329632 | 15.5% |
| Decimal Number | 4669826 | 13.6% |
| Uppercase Letter | 2328022 | 6.8% |
| Other Punctuation | 1051148 | 3.1% |
| Dash Punctuation | 85155 | 0.2% |
| Open Punctuation | 1046 | < 0.1% |
| Close Punctuation | 1046 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2784779 | |
| n | 2544946 | |
| s | 2248410 | |
| t | 2204087 | |
| e | 1972740 | |
| o | 1713724 | |
| i | 1615019 | |
| r | 1557113 | |
| u | 1176463 | |
| f | 1125780 | |
| Other values (16) | 2047487 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 946791 | |
| C | 149806 | 6.4% |
| B | 148594 | 6.4% |
| S | 145163 | 6.2% |
| M | 114316 | 4.9% |
| L | 103998 | 4.5% |
| P | 101273 | 4.4% |
| T | 97016 | 4.2% |
| A | 75811 | 3.3% |
| D | 55609 | 2.4% |
| Other values (16) | 389645 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 966862 | |
| 2 | 637412 | |
| 3 | 521278 | |
| 4 | 442525 | |
| 5 | 404063 | |
| 6 | 403163 | |
| 7 | 351050 | 7.5% |
| 0 | 331517 | 7.1% |
| 8 | 318775 | 6.8% |
| 9 | 293181 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 988409 | |
| & | 41881 | 4.0% |
| ' | 10340 | 1.0% |
| \ | 9919 | 0.9% |
| . | 536 | 0.1% |
| / | 63 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5329632 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 85155 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1046 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1046 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23318570 | |
| Common | 11137853 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2784779 | |
| n | 2544946 | |
| s | 2248410 | |
| t | 2204087 | |
| e | 1972740 | |
| o | 1713724 | 7.3% |
| i | 1615019 | 6.9% |
| r | 1557113 | 6.7% |
| u | 1176463 | 5.0% |
| f | 1125780 | 4.8% |
| Other values (42) | 4375509 |
Common
| Value | Count | Frequency (%) |
| 5329632 | ||
| # | 988409 | 8.9% |
| 1 | 966862 | 8.7% |
| 2 | 637412 | 5.7% |
| 3 | 521278 | 4.7% |
| 4 | 442525 | 4.0% |
| 5 | 404063 | 3.6% |
| 6 | 403163 | 3.6% |
| 7 | 351050 | 3.2% |
| 0 | 331517 | 3.0% |
| Other values (10) | 761942 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34456423 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5329632 | ||
| a | 2784779 | 8.1% |
| n | 2544946 | 7.4% |
| s | 2248410 | 6.5% |
| t | 2204087 | 6.4% |
| e | 1972740 | 5.7% |
| o | 1713724 | 5.0% |
| i | 1615019 | 4.7% |
| r | 1557113 | 4.5% |
| u | 1176463 | 3.4% |
| Other values (62) | 11309510 |
| Distinct | 981940 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 97792 |
| Missing (%) | 9.0% |
| Memory size | 91.4 MiB |
| #1 of 1 places to eat in Agios Ioannis | 6 |
|---|---|
| #1 of 1 places to eat in Weston | 5 |
| #1 of 1 places to eat in Clifton | 5 |
| #1 of 1 places to eat in Spilia | 4 |
| #1 of 1 places to eat in Platanos | 4 |
| Other values (981935) |
Length
| Max length | 77 |
|---|---|
| Median length | 36 |
| Mean length | 37.11203373 |
| Min length | 27 |
Characters and Unicode
| Total characters | 36577806 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 978351 ? |
|---|---|
| Unique (%) | 99.3% |
Sample
| 1st row | #1 of 2 places to eat in Saint-Jouvent |
|---|---|
| 2nd row | #2 of 2 places to eat in Saint-Jouvent |
| 3rd row | #1 of 1 places to eat in Rivarennes |
| 4th row | #1 of 1 places to eat in Lacelle |
| 5th row | #1 of 1 places to eat in Saint-Laurent-de-Levezou |
Common Values
| Value | Count | Frequency (%) |
| #1 of 1 places to eat in Agios Ioannis | 6 | < 0.1% |
| #1 of 1 places to eat in Weston | 5 | < 0.1% |
| #1 of 1 places to eat in Clifton | 5 | < 0.1% |
| #1 of 1 places to eat in Spilia | 4 | < 0.1% |
| #1 of 1 places to eat in Platanos | 4 | < 0.1% |
| #1 of 1 places to eat in Atalaia | 4 | < 0.1% |
| #1 of 1 places to eat in Saint-Symphorien | 4 | < 0.1% |
| #1 of 1 places to eat in Saint-Sauveur | 4 | < 0.1% |
| #1 of 1 places to eat in Buch | 4 | < 0.1% |
| #1 of 1 places to eat in Baron | 3 | < 0.1% |
| Other values (981930) | 985562 | |
| (Missing) | 97792 | 9.0% |
Length
| Value | Count | Frequency (%) |
| in | 987740 | 12.1% |
| of | 986099 | 12.1% |
| places | 985605 | 12.1% |
| to | 985605 | 12.1% |
| eat | 985605 | 12.1% |
| 1 | 80595 | 1.0% |
| 2 | 59004 | 0.7% |
| 3 | 47147 | 0.6% |
| 4 | 40126 | 0.5% |
| 5 | 35309 | 0.4% |
| Other values (77819) | 2947669 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7154943 | ||
| a | 2915828 | 8.0% |
| e | 2845105 | 7.8% |
| o | 2632821 | 7.2% |
| t | 2321189 | 6.3% |
| n | 1647897 | 4.5% |
| i | 1557690 | 4.3% |
| l | 1467477 | 4.0% |
| s | 1375691 | 3.8% |
| c | 1167162 | 3.2% |
| Other values (62) | 11492003 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22132755 | |
| Space Separator | 7154943 | 19.6% |
| Decimal Number | 4970082 | 13.6% |
| Uppercase Letter | 1236380 | 3.4% |
| Other Punctuation | 996652 | 2.7% |
| Dash Punctuation | 84908 | 0.2% |
| Open Punctuation | 1043 | < 0.1% |
| Close Punctuation | 1043 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2915828 | |
| e | 2845105 | |
| o | 2632821 | |
| t | 2321189 | |
| n | 1647897 | |
| i | 1557690 | |
| l | 1467477 | |
| s | 1375691 | |
| c | 1167162 | 5.3% |
| p | 1056259 | 4.8% |
| Other values (16) | 3145636 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 122731 | 9.9% |
| B | 115878 | 9.4% |
| M | 112984 | 9.1% |
| C | 105524 | 8.5% |
| L | 103689 | 8.4% |
| P | 99799 | 8.1% |
| A | 75593 | 6.1% |
| T | 56243 | 4.5% |
| R | 54228 | 4.4% |
| V | 51783 | 4.2% |
| Other values (16) | 337928 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 969273 | |
| 2 | 694484 | |
| 3 | 573121 | |
| 4 | 504651 | |
| 5 | 429869 | |
| 6 | 427514 | |
| 8 | 367854 | 7.4% |
| 7 | 356996 | 7.2% |
| 0 | 327989 | 6.6% |
| 9 | 318331 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 985605 | |
| ' | 10315 | 1.0% |
| . | 536 | 0.1% |
| \ | 133 | < 0.1% |
| / | 62 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7154943 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 84908 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1043 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1043 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23369135 | |
| Common | 13208671 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2915828 | |
| e | 2845105 | |
| o | 2632821 | |
| t | 2321189 | |
| n | 1647897 | 7.1% |
| i | 1557690 | 6.7% |
| l | 1467477 | 6.3% |
| s | 1375691 | 5.9% |
| c | 1167162 | 5.0% |
| p | 1056259 | 4.5% |
| Other values (42) | 4382016 |
Common
| Value | Count | Frequency (%) |
| 7154943 | ||
| # | 985605 | 7.5% |
| 1 | 969273 | 7.3% |
| 2 | 694484 | 5.3% |
| 3 | 573121 | 4.3% |
| 4 | 504651 | 3.8% |
| 5 | 429869 | 3.3% |
| 6 | 427514 | 3.2% |
| 8 | 367854 | 2.8% |
| 7 | 356996 | 2.7% |
| Other values (10) | 744361 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36577806 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7154943 | ||
| a | 2915828 | 8.0% |
| e | 2845105 | 7.8% |
| o | 2632821 | 7.2% |
| t | 2321189 | 6.3% |
| n | 1647897 | 4.5% |
| i | 1557690 | 4.3% |
| l | 1467477 | 4.0% |
| s | 1375691 | 3.8% |
| c | 1167162 | 3.2% |
| Other values (62) | 11492003 |
| Distinct | 39962 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 110634 |
| Missing (%) | 10.2% |
| Memory size | 84.0 MiB |
| Mid-range, French | 20211 |
|---|---|
| Mid-range | 19422 |
| Cheap Eats | 15864 |
| Mid-range, Italian | 14363 |
| Italian | 14103 |
| Other values (39957) |
Length
| Max length | 75 |
|---|---|
| Median length | 32 |
| Mean length | 29.9151448 |
| Min length | 3 |
Characters and Unicode
| Total characters | 29100346 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20177 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | Cheap Eats, French |
|---|---|
| 2nd row | Cheap Eats |
| 3rd row | Cheap Eats, French, European |
| 4th row | Cheap Eats, French |
| 5th row | Mid-range, French |
Common Values
| Value | Count | Frequency (%) |
| Mid-range, French | 20211 | 1.9% |
| Mid-range | 19422 | 1.8% |
| Cheap Eats | 15864 | 1.5% |
| Mid-range, Italian | 14363 | 1.3% |
| Italian | 14103 | 1.3% |
| Mid-range, Bar, British, Pub | 13566 | 1.3% |
| Mid-range, Italian, Seafood, Mediterranean | 13414 | 1.2% |
| Mid-range, Italian, Pizza, Mediterranean | 12562 | 1.2% |
| Mid-range, Italian, Pizza, Seafood | 12099 | 1.1% |
| Cafe | 10613 | 1.0% |
| Other values (39952) | 826546 | |
| (Missing) | 110634 | 10.2% |
Length
| Value | Count | Frequency (%) |
| mid-range | 538207 | 15.2% |
| cheap | 240351 | 6.8% |
| eats | 240351 | 6.8% |
| italian | 236822 | 6.7% |
| european | 192287 | 5.4% |
| mediterranean | 155388 | 4.4% |
| friendly | 133520 | 3.8% |
| vegetarian | 133520 | 3.8% |
| pizza | 113259 | 3.2% |
| cafe | 107793 | 3.0% |
| Other values (200) | 1456411 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3345414 | 11.5% |
| e | 2867306 | 9.9% |
| 2575177 | 8.8% | |
| n | 2357873 | 8.1% |
| i | 2087983 | 7.2% |
| , | 1994966 | 6.9% |
| r | 1944584 | 6.7% |
| t | 1227298 | 4.2% |
| d | 1021411 | 3.5% |
| s | 817661 | 2.8% |
| Other values (47) | 8860673 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20485901 | |
| Uppercase Letter | 3495787 | 12.0% |
| Space Separator | 2575177 | 8.8% |
| Other Punctuation | 1995467 | 6.9% |
| Dash Punctuation | 548014 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3345414 | |
| e | 2867306 | |
| n | 2357873 | |
| i | 2087983 | |
| r | 1944584 | |
| t | 1227298 | 6.0% |
| d | 1021411 | 5.0% |
| s | 817661 | 4.0% |
| g | 755010 | 3.7% |
| o | 657479 | 3.2% |
| Other values (16) | 3403882 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 714263 | |
| E | 446913 | |
| C | 417197 | |
| F | 339000 | |
| I | 308315 | |
| B | 259214 | 7.4% |
| S | 225394 | 6.4% |
| P | 206005 | 5.9% |
| V | 155389 | 4.4% |
| A | 94820 | 2.7% |
| Other values (16) | 329277 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1994966 | |
| & | 495 | < 0.1% |
| / | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2575177 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 548014 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23981688 | |
| Common | 5118658 | 17.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3345414 | |
| e | 2867306 | |
| n | 2357873 | 9.8% |
| i | 2087983 | 8.7% |
| r | 1944584 | 8.1% |
| t | 1227298 | 5.1% |
| d | 1021411 | 4.3% |
| s | 817661 | 3.4% |
| g | 755010 | 3.1% |
| M | 714263 | 3.0% |
| Other values (42) | 6842885 |
Common
| Value | Count | Frequency (%) |
| 2575177 | ||
| , | 1994966 | |
| - | 548014 | 10.7% |
| & | 495 | < 0.1% |
| / | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29100346 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3345414 | 11.5% |
| e | 2867306 | 9.9% |
| 2575177 | 8.8% | |
| n | 2357873 | 8.1% |
| i | 2087983 | 7.2% |
| , | 1994966 | 6.9% |
| r | 1944584 | 6.7% |
| t | 1227298 | 4.2% |
| d | 1021411 | 3.5% |
| s | 817661 | 2.8% |
| Other values (47) | 8860673 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 277205 |
| Missing (%) | 25.6% |
| Memory size | 78.3 MiB |
| €€-€€€ | |
|---|---|
| € | |
| €€€€ | 28069 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 4.440615883 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3579989 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | € |
|---|---|
| 2nd row | € |
| 3rd row | € |
| 4th row | € |
| 5th row | €€-€€€ |
Common Values
| Value | Count | Frequency (%) |
| €€-€€€ | 537918 | |
| € | 240205 | |
| €€€€ | 28069 | 2.6% |
| (Missing) | 277205 |
Length
Pie chart
| Value | Count | Frequency (%) |
| €€-€€€ | 537918 | |
| € | 240205 | |
| €€€€ | 28069 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| € | 3042071 | |
| - | 537918 | 15.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Currency Symbol | 3042071 | |
| Dash Punctuation | 537918 | 15.0% |
Most frequent character per category
Currency Symbol
| Value | Count | Frequency (%) |
| € | 3042071 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 537918 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3579989 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| € | 3042071 | |
| - | 537918 | 15.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Currency Symbols | 3042071 | |
| ASCII | 537918 | 15.0% |
Most frequent character per block
Currency Symbols
| Value | Count | Frequency (%) |
| € | 3042071 |
ASCII
| Value | Count | Frequency (%) |
| - | 537918 |
| Distinct | 7298 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 779070 |
| Missing (%) | 71.9% |
| Memory size | 51.2 MiB |
| €10-€30 | 5937 |
|---|---|
| €5-€15 | 5810 |
| €10-€20 | 5148 |
| €5-€20 | 4793 |
| €10-€25 | 4448 |
| Other values (7293) |
Length
| Max length | 25 |
|---|---|
| Median length | 6 |
| Mean length | 6.417379989 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1952982 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3478 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | €14-€29 |
|---|---|
| 2nd row | €8-€17 |
| 3rd row | €10-€35 |
| 4th row | €12-€26 |
| 5th row | €12-€30 |
Common Values
| Value | Count | Frequency (%) |
| €10-€30 | 5937 | 0.5% |
| €5-€15 | 5810 | 0.5% |
| €10-€20 | 5148 | 0.5% |
| €5-€20 | 4793 | 0.4% |
| €10-€25 | 4448 | 0.4% |
| €5-€10 | 3965 | 0.4% |
| €15-€30 | 3735 | 0.3% |
| €2-€10 | 3416 | 0.3% |
| €3-€10 | 3385 | 0.3% |
| €5-€25 | 2682 | 0.2% |
| Other values (7288) | 261008 | 24.1% |
| (Missing) | 779070 |
Length
| Value | Count | Frequency (%) |
| €10-€30 | 5937 | 1.9% |
| €5-€15 | 5810 | 1.9% |
| €10-€20 | 5148 | 1.6% |
| €5-€20 | 4793 | 1.5% |
| €10-€25 | 4448 | 1.4% |
| chf | 4207 | 1.3% |
| €5-€10 | 3965 | 1.3% |
| €15-€30 | 3735 | 1.2% |
| €2-€10 | 3416 | 1.1% |
| €3-€10 | 3385 | 1.1% |
| Other values (6577) | 267897 |
Most occurring characters
| Value | Count | Frequency (%) |
| € | 600240 | |
| - | 304327 | |
| 1 | 214923 | 11.0% |
| 2 | 171434 | 8.8% |
| 5 | 144512 | 7.4% |
| 0 | 142594 | 7.3% |
| 3 | 103178 | 5.3% |
| 4 | 62716 | 3.2% |
| 6 | 48522 | 2.5% |
| 8 | 46647 | 2.4% |
| Other values (7) | 113889 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1014069 | |
| Currency Symbol | 600240 | |
| Dash Punctuation | 304327 | 15.6% |
| Uppercase Letter | 25242 | 1.3% |
| Space Separator | 8414 | 0.4% |
| Other Punctuation | 690 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 214923 | |
| 2 | 171434 | |
| 5 | 144512 | |
| 0 | 142594 | |
| 3 | 103178 | |
| 4 | 62716 | 6.2% |
| 6 | 48522 | 4.8% |
| 8 | 46647 | 4.6% |
| 7 | 46198 | 4.6% |
| 9 | 33345 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 8414 | |
| H | 8414 | |
| F | 8414 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 600240 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 304327 |
Space Separator
| Value | Count | Frequency (%) |
| 8414 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 690 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1927740 | |
| Latin | 25242 | 1.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| € | 600240 | |
| - | 304327 | |
| 1 | 214923 | 11.1% |
| 2 | 171434 | 8.9% |
| 5 | 144512 | 7.5% |
| 0 | 142594 | 7.4% |
| 3 | 103178 | 5.4% |
| 4 | 62716 | 3.3% |
| 6 | 48522 | 2.5% |
| 8 | 46647 | 2.4% |
| Other values (4) | 88647 | 4.6% |
Latin
| Value | Count | Frequency (%) |
| C | 8414 | |
| H | 8414 | |
| F | 8414 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1344328 | |
| Currency Symbols | 600240 | |
| None | 8414 | 0.4% |
Most frequent character per block
Currency Symbols
| Value | Count | Frequency (%) |
| € | 600240 |
ASCII
| Value | Count | Frequency (%) |
| - | 304327 | |
| 1 | 214923 | |
| 2 | 171434 | |
| 5 | 144512 | |
| 0 | 142594 | |
| 3 | 103178 | 7.7% |
| 4 | 62716 | 4.7% |
| 6 | 48522 | 3.6% |
| 8 | 46647 | 3.5% |
| 7 | 46198 | 3.4% |
| Other values (5) | 59277 | 4.4% |
None
| Value | Count | Frequency (%) |
| 8414 |
| Distinct | 745 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 448050 |
| Missing (%) | 41.4% |
| Memory size | 59.2 MiB |
| Lunch, Dinner | |
|---|---|
| Dinner | |
| Breakfast, Lunch, Dinner | |
| Lunch, Dinner, After-hours | |
| Dinner, Lunch | |
| Other values (740) |
Length
| Max length | 53 |
|---|---|
| Median length | 13 |
| Mean length | 18.11040738 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11506393 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 161 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lunch, Dinner |
|---|---|
| 2nd row | Dinner, Lunch, Drinks |
| 3rd row | Lunch, Dinner |
| 4th row | Lunch, Dinner |
| 5th row | Lunch, Dinner, Drinks |
Common Values
| Value | Count | Frequency (%) |
| Lunch, Dinner | 196123 | |
| Dinner | 67459 | 6.2% |
| Breakfast, Lunch, Dinner | 51749 | 4.8% |
| Lunch, Dinner, After-hours | 31493 | 2.9% |
| Dinner, Lunch | 27103 | 2.5% |
| Lunch, Dinner, Drinks | 23327 | 2.2% |
| Lunch | 23305 | 2.2% |
| Breakfast | 13608 | 1.3% |
| Breakfast, Lunch | 13328 | 1.2% |
| Breakfast, Lunch, Brunch | 11457 | 1.1% |
| Other values (735) | 176395 | 16.3% |
| (Missing) | 448050 |
Length
| Value | Count | Frequency (%) |
| dinner | 532366 | |
| lunch | 511678 | |
| breakfast | 181695 | 11.8% |
| drinks | 117450 | 7.6% |
| brunch | 101571 | 6.6% |
| after-hours | 91200 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1795431 | |
| r | 1115482 | |
| , | 900613 | 7.8% |
| 900613 | 7.8% | |
| e | 805261 | 7.0% |
| u | 704449 | 6.1% |
| h | 704449 | 6.1% |
| D | 649816 | 5.6% |
| i | 649816 | 5.6% |
| c | 613249 | 5.3% |
| Other values (10) | 2667214 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8078007 | |
| Uppercase Letter | 1535960 | 13.3% |
| Other Punctuation | 900613 | 7.8% |
| Space Separator | 900613 | 7.8% |
| Dash Punctuation | 91200 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1795431 | |
| r | 1115482 | |
| e | 805261 | |
| u | 704449 | 8.7% |
| h | 704449 | 8.7% |
| i | 649816 | 8.0% |
| c | 613249 | 7.6% |
| s | 390345 | 4.8% |
| a | 363390 | 4.5% |
| k | 299145 | 3.7% |
| Other values (3) | 636990 | 7.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 649816 | |
| L | 511678 | |
| B | 283266 | |
| A | 91200 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 900613 |
Space Separator
| Value | Count | Frequency (%) |
| 900613 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 91200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9613967 | |
| Common | 1892426 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1795431 | |
| r | 1115482 | |
| e | 805261 | |
| u | 704449 | 7.3% |
| h | 704449 | 7.3% |
| D | 649816 | 6.8% |
| i | 649816 | 6.8% |
| c | 613249 | 6.4% |
| L | 511678 | 5.3% |
| s | 390345 | 4.1% |
| Other values (7) | 1673991 |
Common
| Value | Count | Frequency (%) |
| , | 900613 | |
| 900613 | ||
| - | 91200 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11506393 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1795431 | |
| r | 1115482 | |
| , | 900613 | 7.8% |
| 900613 | 7.8% | |
| e | 805261 | 7.0% |
| u | 704449 | 6.1% |
| h | 704449 | 6.1% |
| D | 649816 | 5.6% |
| i | 649816 | 5.6% |
| c | 613249 | 5.3% |
| Other values (10) | 2667214 |
| Distinct | 97741 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 169103 |
| Missing (%) | 15.6% |
| Memory size | 73.3 MiB |
| Italian | 53243 |
|---|---|
| French | 39103 |
| Cafe | 35009 |
| Spanish | 27339 |
| Italian, Pizza | 26998 |
| Other values (97736) |
Length
| Max length | 185 |
|---|---|
| Median length | 17 |
| Mean length | 21.11882392 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19308814 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 73786 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | French |
|---|---|
| 2nd row | French, European |
| 3rd row | French |
| 4th row | French |
| 5th row | French |
Common Values
| Value | Count | Frequency (%) |
| Italian | 53243 | 4.9% |
| French | 39103 | 3.6% |
| Cafe | 35009 | 3.2% |
| Spanish | 27339 | 2.5% |
| Italian, Pizza | 26998 | 2.5% |
| French, European | 14323 | 1.3% |
| Fast food | 13803 | 1.3% |
| Bar, British, Pub | 13703 | 1.3% |
| Pizza | 13440 | 1.2% |
| German | 13244 | 1.2% |
| Other values (97731) | 664089 | |
| (Missing) | 169103 | 15.6% |
Length
| Value | Count | Frequency (%) |
| italian | 235823 | 9.9% |
| european | 234759 | 9.8% |
| mediterranean | 173020 | 7.2% |
| pizza | 114070 | 4.8% |
| cafe | 109188 | 4.6% |
| bar | 107990 | 4.5% |
| french | 98480 | 4.1% |
| spanish | 93191 | 3.9% |
| pub | 91258 | 3.8% |
| seafood | 81397 | 3.4% |
| Other values (155) | 1049222 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2358346 | |
| e | 1755284 | 9.1% |
| n | 1688174 | 8.7% |
| 1474104 | 7.6% | |
| , | 1318639 | 6.8% |
| r | 1313379 | 6.8% |
| i | 1290115 | 6.7% |
| t | 930201 | 4.8% |
| o | 771444 | 4.0% |
| u | 553163 | 2.9% |
| Other values (43) | 5855965 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14116785 | |
| Uppercase Letter | 2358757 | 12.2% |
| Space Separator | 1474104 | 7.6% |
| Other Punctuation | 1319146 | 6.8% |
| Dash Punctuation | 40022 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2358346 | |
| e | 1755284 | |
| n | 1688174 | |
| r | 1313379 | |
| i | 1290115 | |
| t | 930201 | 6.6% |
| o | 771444 | 5.5% |
| u | 553163 | 3.9% |
| s | 530744 | 3.8% |
| l | 496678 | 3.5% |
| Other values (15) | 2429257 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 339595 | |
| S | 277265 | |
| E | 255737 | |
| P | 242532 | |
| B | 236992 | |
| C | 213756 | |
| M | 196285 | |
| F | 182448 | |
| A | 110350 | 4.7% |
| G | 104079 | 4.4% |
| Other values (14) | 199718 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1318639 | |
| & | 507 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1474104 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40022 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16475542 | |
| Common | 2833272 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2358346 | |
| e | 1755284 | 10.7% |
| n | 1688174 | 10.2% |
| r | 1313379 | 8.0% |
| i | 1290115 | 7.8% |
| t | 930201 | 5.6% |
| o | 771444 | 4.7% |
| u | 553163 | 3.4% |
| s | 530744 | 3.2% |
| l | 496678 | 3.0% |
| Other values (39) | 4788014 |
Common
| Value | Count | Frequency (%) |
| 1474104 | ||
| , | 1318639 | |
| - | 40022 | 1.4% |
| & | 507 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19308814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2358346 | |
| e | 1755284 | 9.1% |
| n | 1688174 | 8.7% |
| 1474104 | 7.6% | |
| , | 1318639 | 6.8% |
| r | 1313379 | 6.8% |
| i | 1290115 | 6.7% |
| t | 930201 | 4.8% |
| o | 771444 | 4.0% |
| u | 553163 | 2.9% |
| Other values (43) | 5855965 |
| Distinct | 68 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 743141 |
| Missing (%) | 68.6% |
| Memory size | 51.5 MiB |
| Vegetarian Friendly | |
|---|---|
| Vegetarian Friendly, Vegan Options, Gluten Free Options | |
| Vegetarian Friendly, Vegan Options | |
| Vegetarian Friendly, Gluten Free Options | |
| Gluten Free Options | 9898 |
| Other values (63) |
Length
| Max length | 70 |
|---|---|
| Median length | 19 |
| Mean length | 31.78660773 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10815584 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Vegetarian Friendly |
|---|---|
| 2nd row | Vegetarian Friendly |
| 3rd row | Vegetarian Friendly |
| 4th row | Vegetarian Friendly |
| 5th row | Vegetarian Friendly |
Common Values
| Value | Count | Frequency (%) |
| Vegetarian Friendly | 156652 | 14.5% |
| Vegetarian Friendly, Vegan Options, Gluten Free Options | 71379 | 6.6% |
| Vegetarian Friendly, Vegan Options | 49606 | 4.6% |
| Vegetarian Friendly, Gluten Free Options | 32205 | 3.0% |
| Gluten Free Options | 9898 | 0.9% |
| Vegetarian Friendly, Gluten Free Options, Vegan Options | 3875 | 0.4% |
| Vegan Options | 3660 | 0.3% |
| Vegan Options, Vegetarian Friendly | 2034 | 0.2% |
| Halal | 1730 | 0.2% |
| Vegetarian Friendly, Vegan Options, Halal, Gluten Free Options | 1706 | 0.2% |
| Other values (58) | 7511 | 0.7% |
| (Missing) | 743141 |
Length
| Value | Count | Frequency (%) |
| vegetarian | 324017 | |
| friendly | 324017 | |
| options | 260094 | |
| vegan | 136597 | |
| gluten | 123497 | 9.5% |
| free | 123497 | 9.5% |
| halal | 6709 | 0.5% |
| kosher | 298 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1479437 | |
| n | 1168222 | |
| 958470 | 8.9% | |
| i | 908128 | 8.4% |
| a | 798049 | 7.4% |
| r | 771829 | 7.1% |
| t | 707608 | 6.5% |
| l | 460932 | 4.3% |
| V | 460614 | 4.3% |
| g | 460614 | 4.3% |
| Other values (13) | 2641681 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8307526 | |
| Uppercase Letter | 1298726 | 12.0% |
| Space Separator | 958470 | 8.9% |
| Other Punctuation | 250862 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1479437 | |
| n | 1168222 | |
| i | 908128 | |
| a | 798049 | |
| r | 771829 | |
| t | 707608 | |
| l | 460932 | 5.5% |
| g | 460614 | 5.5% |
| d | 324017 | 3.9% |
| y | 324017 | 3.9% |
| Other values (5) | 904673 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 460614 | |
| F | 447514 | |
| O | 260094 | |
| G | 123497 | 9.5% |
| H | 6709 | 0.5% |
| K | 298 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 958470 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 250862 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9606252 | |
| Common | 1209332 | 11.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1479437 | |
| n | 1168222 | |
| i | 908128 | |
| a | 798049 | |
| r | 771829 | 8.0% |
| t | 707608 | 7.4% |
| l | 460932 | 4.8% |
| V | 460614 | 4.8% |
| g | 460614 | 4.8% |
| F | 447514 | 4.7% |
| Other values (11) | 1943305 |
Common
| Value | Count | Frequency (%) |
| 958470 | ||
| , | 250862 | 20.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10815584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1479437 | |
| n | 1168222 | |
| 958470 | 8.9% | |
| i | 908128 | 8.4% |
| a | 798049 | 7.4% |
| r | 771829 | 7.1% |
| t | 707608 | 6.5% |
| l | 460932 | 4.3% |
| V | 460614 | 4.3% |
| g | 460614 | 4.3% |
| Other values (13) | 2641681 |
| Distinct | 56453 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 765990 |
| Missing (%) | 70.7% |
| Memory size | 61.5 MiB |
| Reservations | |
|---|---|
| Reservations, Seating, Table Service | 15193 |
| Takeout | 7290 |
| Reservations, Seating, Serves Alcohol, Table Service | 7181 |
| Wheelchair Accessible | 5863 |
| Other values (56448) |
Length
| Max length | 588 |
|---|---|
| Median length | 52 |
| Mean length | 68.95317684 |
| Min length | 4 |
Characters and Unicode
| Total characters | 21886221 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 48360 ? |
|---|---|
| Unique (%) | 15.2% |
Sample
| 1st row | Reservations, Seating, Wheelchair Accessible, Serves Alcohol, Accepts Credit Cards, Table Service |
|---|---|
| 2nd row | Reservations, Seating, Table Service, Wheelchair Accessible |
| 3rd row | Reservations, Seating, Serves Alcohol, Table Service, Wheelchair Accessible |
| 4th row | Reservations, Seating, Wheelchair Accessible, Table Service |
| 5th row | Reservations, Seating, Table Service, Serves Alcohol |
Common Values
| Value | Count | Frequency (%) |
| Reservations | 36514 | 3.4% |
| Reservations, Seating, Table Service | 15193 | 1.4% |
| Takeout | 7290 | 0.7% |
| Reservations, Seating, Serves Alcohol, Table Service | 7181 | 0.7% |
| Wheelchair Accessible | 5863 | 0.5% |
| Seating | 5815 | 0.5% |
| Takeout, Wheelchair Accessible | 5591 | 0.5% |
| Reservations, Seating, Wheelchair Accessible, Table Service | 5352 | 0.5% |
| Seating, Table Service | 5097 | 0.5% |
| Reservations, Seating, Wheelchair Accessible, Serves Alcohol, Table Service | 4446 | 0.4% |
| Other values (56443) | 219065 | 20.2% |
| (Missing) | 765990 |
Length
| Value | Count | Frequency (%) |
| seating | 302951 | 12.0% |
| reservations | 215387 | 8.5% |
| table | 191467 | 7.6% |
| service | 191467 | 7.6% |
| wheelchair | 146385 | 5.8% |
| accessible | 146385 | 5.8% |
| alcohol | 129553 | 5.1% |
| serves | 129553 | 5.1% |
| accepts | 101329 | 4.0% |
| takeout | 94983 | 3.7% |
| Other values (49) | 885559 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2885400 | |
| 2217612 | 10.1% | |
| i | 1576833 | 7.2% |
| a | 1501036 | 6.9% |
| r | 1241264 | 5.7% |
| s | 1187791 | 5.4% |
| , | 1187317 | 5.4% |
| l | 1092206 | 5.0% |
| c | 1046080 | 4.8% |
| t | 976838 | 4.5% |
| Other values (36) | 6973844 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15987258 | |
| Uppercase Letter | 2479083 | 11.3% |
| Space Separator | 2217612 | 10.1% |
| Other Punctuation | 1187317 | 5.4% |
| Dash Punctuation | 14951 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2885400 | |
| i | 1576833 | |
| a | 1501036 | |
| r | 1241264 | |
| s | 1187791 | 7.4% |
| l | 1092206 | 6.8% |
| c | 1046080 | 6.5% |
| t | 976838 | 6.1% |
| o | 763497 | 4.8% |
| n | 700102 | 4.4% |
| Other values (13) | 3016211 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 638443 | |
| A | 471242 | |
| T | 304790 | |
| R | 215387 | 8.7% |
| W | 215063 | 8.7% |
| C | 128391 | 5.2% |
| F | 112828 | 4.6% |
| B | 81191 | 3.3% |
| O | 81052 | 3.3% |
| P | 67562 | 2.7% |
| Other values (10) | 163134 | 6.6% |
Space Separator
| Value | Count | Frequency (%) |
| 2217612 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1187317 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14951 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18466341 | |
| Common | 3419880 | 15.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2885400 | |
| i | 1576833 | 8.5% |
| a | 1501036 | 8.1% |
| r | 1241264 | 6.7% |
| s | 1187791 | 6.4% |
| l | 1092206 | 5.9% |
| c | 1046080 | 5.7% |
| t | 976838 | 5.3% |
| o | 763497 | 4.1% |
| n | 700102 | 3.8% |
| Other values (33) | 5495294 |
Common
| Value | Count | Frequency (%) |
| 2217612 | ||
| , | 1187317 | |
| - | 14951 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21886221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2885400 | |
| 2217612 | 10.1% | |
| i | 1576833 | 7.2% |
| a | 1501036 | 6.9% |
| r | 1241264 | 5.7% |
| s | 1187791 | 5.4% |
| , | 1187317 | 5.4% |
| l | 1092206 | 5.0% |
| c | 1046080 | 4.8% |
| t | 976838 | 4.5% |
| Other values (36) | 6973844 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 759380 | |
| True | 324017 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 946800 | |
| True | 136597 | 12.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 959900 | |
| True | 123497 | 11.4% |
| Distinct | 237890 |
|---|---|
| Distinct (%) | 40.1% |
| Missing | 489565 |
| Missing (%) | 45.2% |
| Memory size | 148.5 MiB |
| {"Mon": ["00:00-23:59"], "Tue": ["00:00-23:59"], "Wed": ["00:00-23:59"], "Thu": ["00:00-23:59"], "Fri": ["00:00-23:59"], "Sat": ["00:00-23:59"], "Sun": ["00:00-23:59"]} | 7674 |
|---|---|
| {"Mon": ["11:00-23:00"], "Tue": ["11:00-23:00"], "Wed": ["11:00-23:00"], "Thu": ["11:00-23:00"], "Fri": ["11:00-23:00"], "Sat": ["11:00-23:00"], "Sun": ["11:00-23:00"]} | 5303 |
| {"Mon": ["12:00-22:00"], "Tue": ["12:00-22:00"], "Wed": ["12:00-22:00"], "Thu": ["12:00-22:00"], "Fri": ["12:00-22:00"], "Sat": ["12:00-22:00"], "Sun": ["12:00-22:00"]} | 4234 |
| {"Mon": ["12:00-23:00"], "Tue": ["12:00-23:00"], "Wed": ["12:00-23:00"], "Thu": ["12:00-23:00"], "Fri": ["12:00-23:00"], "Sat": ["12:00-23:00"], "Sun": ["12:00-23:00"]} | 3677 |
| {"Mon": ["12:00-00:00"], "Tue": ["12:00-00:00"], "Wed": ["12:00-00:00"], "Thu": ["12:00-00:00"], "Fri": ["12:00-00:00"], "Sat": ["12:00-00:00"], "Sun": ["12:00-00:00"]} | 3673 |
| Other values (237885) |
Length
| Max length | 292 |
|---|---|
| Median length | 168 |
| Mean length | 178.8071508 |
| Min length | 90 |
Characters and Unicode
| Total characters | 106181408 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 198640 ? |
|---|---|
| Unique (%) | 33.5% |
Sample
| 1st row | {"Mon": ["09:00-14:30"], "Tue": ["09:00-14:30", "19:00-21:30"], "Wed": ["09:00-14:30", "19:00-21:30"], "Thu": ["09:00-14:30", "19:00-21:30"], "Fri": ["09:00-14:30", "19:00-22:00"], "Sat": ["09:00-14:30", "19:00-22:00"], "Sun": ["09:00-16:00"]} |
|---|---|
| 2nd row | {"Mon": [], "Tue": [], "Wed": ["12:00-14:30", "18:30-22:00"], "Thu": ["12:00-14:30", "18:30-22:00"], "Fri": ["12:00-14:30", "18:30-22:00"], "Sat": ["12:00-14:30", "18:30-22:00"], "Sun": ["12:00-14:30", "18:30-22:00"]} |
| 3rd row | {"Mon": [], "Tue": ["10:00-14:00"], "Wed": ["10:00-14:00"], "Thu": ["10:00-14:00"], "Fri": ["10:00-14:00"], "Sat": ["10:00-14:00"], "Sun": ["10:00-14:00"]} |
| 4th row | {"Mon": [], "Tue": [], "Wed": ["12:00-14:00"], "Thu": ["12:00-14:00"], "Fri": ["12:00-14:00", "19:00-21:00"], "Sat": ["19:00-21:00"], "Sun": ["12:00-14:00", "19:00-21:00"]} |
| 5th row | {"Mon": [], "Tue": ["09:00-16:00"], "Wed": ["09:00-16:00"], "Thu": ["09:00-21:00"], "Fri": ["09:00-21:00"], "Sat": ["16:45-23:45"], "Sun": ["09:00-17:00"]} |
Common Values
| Value | Count | Frequency (%) |
| {"Mon": ["00:00-23:59"], "Tue": ["00:00-23:59"], "Wed": ["00:00-23:59"], "Thu": ["00:00-23:59"], "Fri": ["00:00-23:59"], "Sat": ["00:00-23:59"], "Sun": ["00:00-23:59"]} | 7674 | 0.7% |
| {"Mon": ["11:00-23:00"], "Tue": ["11:00-23:00"], "Wed": ["11:00-23:00"], "Thu": ["11:00-23:00"], "Fri": ["11:00-23:00"], "Sat": ["11:00-23:00"], "Sun": ["11:00-23:00"]} | 5303 | 0.5% |
| {"Mon": ["12:00-22:00"], "Tue": ["12:00-22:00"], "Wed": ["12:00-22:00"], "Thu": ["12:00-22:00"], "Fri": ["12:00-22:00"], "Sat": ["12:00-22:00"], "Sun": ["12:00-22:00"]} | 4234 | 0.4% |
| {"Mon": ["12:00-23:00"], "Tue": ["12:00-23:00"], "Wed": ["12:00-23:00"], "Thu": ["12:00-23:00"], "Fri": ["12:00-23:00"], "Sat": ["12:00-23:00"], "Sun": ["12:00-23:00"]} | 3677 | 0.3% |
| {"Mon": ["12:00-00:00"], "Tue": ["12:00-00:00"], "Wed": ["12:00-00:00"], "Thu": ["12:00-00:00"], "Fri": ["12:00-00:00"], "Sat": ["12:00-00:00"], "Sun": ["12:00-00:00"]} | 3673 | 0.3% |
| {"Mon": ["11:00-22:00"], "Tue": ["11:00-22:00"], "Wed": ["11:00-22:00"], "Thu": ["11:00-22:00"], "Fri": ["11:00-22:00"], "Sat": ["11:00-22:00"], "Sun": ["11:00-22:00"]} | 3299 | 0.3% |
| {"Mon": ["10:00-22:00"], "Tue": ["10:00-22:00"], "Wed": ["10:00-22:00"], "Thu": ["10:00-22:00"], "Fri": ["10:00-22:00"], "Sat": ["10:00-22:00"], "Sun": ["10:00-22:00"]} | 2632 | 0.2% |
| {"Mon": ["11:00-00:00"], "Tue": ["11:00-00:00"], "Wed": ["11:00-00:00"], "Thu": ["11:00-00:00"], "Fri": ["11:00-00:00"], "Sat": ["11:00-00:00"], "Sun": ["11:00-00:00"]} | 2405 | 0.2% |
| {"Mon": ["09:00-00:00"], "Tue": ["09:00-00:00"], "Wed": ["09:00-00:00"], "Thu": ["09:00-00:00"], "Fri": ["09:00-00:00"], "Sat": ["09:00-00:00"], "Sun": ["09:00-00:00"]} | 2329 | 0.2% |
| {"Mon": ["09:00-23:00"], "Tue": ["09:00-23:00"], "Wed": ["09:00-23:00"], "Thu": ["09:00-23:00"], "Fri": ["09:00-23:00"], "Sat": ["09:00-23:00"], "Sun": ["09:00-23:00"]} | 2184 | 0.2% |
| Other values (237880) | 556422 | |
| (Missing) | 489565 |
Length
| Value | Count | Frequency (%) |
| mon | 593832 | 6.5% |
| tue | 593832 | 6.5% |
| wed | 593832 | 6.5% |
| thu | 593832 | 6.5% |
| fri | 593832 | 6.5% |
| sat | 593832 | 6.5% |
| sun | 593832 | 6.5% |
| 399601 | 4.4% | |
| 12:00-15:00 | 111669 | 1.2% |
| 12:00-14:00 | 107585 | 1.2% |
| Other values (5282) | 4312132 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18744680 | |
| " | 17376420 | |
| : | 13219596 | |
| 8493979 | ||
| 1 | 5962565 | 5.6% |
| - | 4531386 | 4.3% |
| , | 4337155 | 4.1% |
| ] | 4156824 | 3.9% |
| [ | 4156824 | 3.9% |
| 2 | 4138667 | 3.9% |
| Other values (24) | 21063312 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36251088 | |
| Other Punctuation | 34933171 | |
| Space Separator | 8493979 | 8.0% |
| Lowercase Letter | 8313648 | 7.8% |
| Close Punctuation | 4750656 | 4.5% |
| Open Punctuation | 4750656 | 4.5% |
| Dash Punctuation | 4531386 | 4.3% |
| Uppercase Letter | 4156824 | 3.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18744680 | |
| 1 | 5962565 | 16.4% |
| 2 | 4138667 | 11.4% |
| 3 | 3167626 | 8.7% |
| 9 | 1007849 | 2.8% |
| 8 | 766051 | 2.1% |
| 7 | 716489 | 2.0% |
| 5 | 695343 | 1.9% |
| 4 | 626401 | 1.7% |
| 6 | 425417 | 1.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1781496 | |
| n | 1187664 | |
| e | 1187664 | |
| h | 593832 | 7.1% |
| r | 593832 | 7.1% |
| i | 593832 | 7.1% |
| d | 593832 | 7.1% |
| a | 593832 | 7.1% |
| t | 593832 | 7.1% |
| o | 593832 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1187664 | |
| T | 1187664 | |
| F | 593832 | |
| W | 593832 | |
| M | 593832 |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 17376420 | |
| : | 13219596 | |
| , | 4337155 | 12.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 4156824 | |
| } | 593832 | 12.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 4156824 | |
| { | 593832 | 12.5% |
Space Separator
| Value | Count | Frequency (%) |
| 8493979 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4531386 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 93710936 | |
| Latin | 12470472 | 11.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 18744680 | |
| " | 17376420 | |
| : | 13219596 | |
| 8493979 | ||
| 1 | 5962565 | 6.4% |
| - | 4531386 | 4.8% |
| , | 4337155 | 4.6% |
| ] | 4156824 | 4.4% |
| [ | 4156824 | 4.4% |
| 2 | 4138667 | 4.4% |
| Other values (9) | 8592840 |
Latin
| Value | Count | Frequency (%) |
| u | 1781496 | |
| S | 1187664 | 9.5% |
| n | 1187664 | 9.5% |
| T | 1187664 | 9.5% |
| e | 1187664 | 9.5% |
| h | 593832 | 4.8% |
| F | 593832 | 4.8% |
| r | 593832 | 4.8% |
| i | 593832 | 4.8% |
| d | 593832 | 4.8% |
| Other values (5) | 2969160 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106181408 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 18744680 | |
| " | 17376420 | |
| : | 13219596 | |
| 8493979 | ||
| 1 | 5962565 | 5.6% |
| - | 4531386 | 4.3% |
| , | 4337155 | 4.1% |
| ] | 4156824 | 3.9% |
| [ | 4156824 | 3.9% |
| 2 | 4138667 | 3.9% |
| Other values (24) | 21063312 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 489565 |
| Missing (%) | 45.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.327080723 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 6 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9667343474 |
|---|---|
| Coefficient of variation (CV) | 0.152793111 |
| Kurtosis | 6.671706407 |
| Mean | 6.327080723 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.178990536 |
| Sum | 3757223 |
| Variance | 0.9345752985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 323729 | |
| 6 | 187868 | 17.3% |
| 5 | 58202 | 5.4% |
| 4 | 10127 | 0.9% |
| 3 | 7406 | 0.7% |
| 2 | 3676 | 0.3% |
| 1 | 2824 | 0.3% |
| (Missing) | 489565 |
| Value | Count | Frequency (%) |
| 1 | 2824 | 0.3% |
| 2 | 3676 | 0.3% |
| 3 | 7406 | 0.7% |
| 4 | 10127 | 0.9% |
| 5 | 58202 | 5.4% |
| 6 | 187868 | |
| 7 | 323729 |
| Value | Count | Frequency (%) |
| 7 | 323729 | |
| 6 | 187868 | |
| 5 | 58202 | 5.4% |
| 4 | 10127 | 0.9% |
| 3 | 7406 | 0.7% |
| 2 | 3676 | 0.3% |
| 1 | 2824 | 0.3% |
| Distinct | 3105 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 489565 |
| Missing (%) | 45.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.02328217 |
| Minimum | 0 |
|---|---|
| Maximum | 168 |
| Zeros | 1479 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 39 |
| median | 58.5 |
| Q3 | 81.5 |
| 95-th percentile | 113.5 |
| Maximum | 168 |
| Range | 168 |
| Interquartile range (IQR) | 42.5 |
Descriptive statistics
| Standard deviation | 30.53813355 |
|---|---|
| Coefficient of variation (CV) | 0.4923656485 |
| Kurtosis | 0.8317549838 |
| Mean | 62.02328217 |
| Median Absolute Deviation (MAD) | 20.5 |
| Skewness | 0.7457016458 |
| Sum | 36831409.7 |
| Variance | 932.5776009 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 84 | 19158 | 1.8% |
| 42 | 15362 | 1.4% |
| 77 | 12486 | 1.2% |
| 70 | 12228 | 1.1% |
| 56 | 10711 | 1.0% |
| 63 | 10552 | 1.0% |
| 30 | 9960 | 0.9% |
| 36 | 9705 | 0.9% |
| 91 | 9150 | 0.8% |
| 49 | 9132 | 0.8% |
| Other values (3095) | 475388 | |
| (Missing) | 489565 |
| Value | Count | Frequency (%) |
| 0 | 1479 | |
| 0.01666666667 | 5 | < 0.1% |
| 0.03333333333 | 1 | < 0.1% |
| 0.1166666667 | 3 | < 0.1% |
| 0.1666666667 | 1 | < 0.1% |
| 0.2333333333 | 3 | < 0.1% |
| 0.25 | 443 | < 0.1% |
| 0.5 | 75 | < 0.1% |
| 0.5833333333 | 1 | < 0.1% |
| 0.6666666667 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 168 | 5 | < 0.1% |
| 167.9 | 1 | < 0.1% |
| 167.8833333 | 7703 | |
| 167.7666667 | 21 | < 0.1% |
| 167.65 | 5 | < 0.1% |
| 167.5 | 1 | < 0.1% |
| 167.4166667 | 1 | < 0.1% |
| 167.4 | 3 | < 0.1% |
| 167.1833333 | 2 | < 0.1% |
| 167.0666667 | 1 | < 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 489565 |
| Missing (%) | 45.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.630754153 |
| Minimum | 1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 6 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 14 |
| Maximum | 15 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.550127909 |
|---|---|
| Coefficient of variation (CV) | 0.3341908096 |
| Kurtosis | 0.7213945519 |
| Mean | 7.630754153 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.050544374 |
| Sum | 4531386 |
| Variance | 6.50315235 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 269200 | |
| 6 | 120076 | 11.1% |
| 5 | 39570 | 3.7% |
| 12 | 37983 | 3.5% |
| 14 | 30482 | 2.8% |
| 11 | 22390 | 2.1% |
| 10 | 19618 | 1.8% |
| 13 | 11757 | 1.1% |
| 8 | 11331 | 1.0% |
| 9 | 10837 | 1.0% |
| Other values (5) | 20588 | 1.9% |
| (Missing) | 489565 |
| Value | Count | Frequency (%) |
| 1 | 2751 | 0.3% |
| 2 | 3417 | 0.3% |
| 3 | 6051 | 0.6% |
| 4 | 8239 | 0.8% |
| 5 | 39570 | 3.7% |
| 6 | 120076 | |
| 7 | 269200 | |
| 8 | 11331 | 1.0% |
| 9 | 10837 | 1.0% |
| 10 | 19618 | 1.8% |
| Value | Count | Frequency (%) |
| 15 | 130 | < 0.1% |
| 14 | 30482 | 2.8% |
| 13 | 11757 | 1.1% |
| 12 | 37983 | 3.5% |
| 11 | 22390 | 2.1% |
| 10 | 19618 | 1.8% |
| 9 | 10837 | 1.0% |
| 8 | 11331 | 1.0% |
| 7 | 269200 | |
| 6 | 120076 |
avg_rating
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 96636 |
| Missing (%) | 8.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.035942847 |
| Minimum | 1 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.713694003 |
|---|---|
| Coefficient of variation (CV) | 0.1768345167 |
| Kurtosis | 2.071824575 |
| Mean | 4.035942847 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -1.13780692 |
| Sum | 3982511 |
| Variance | 0.5093591299 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 296979 | |
| 4.5 | 293794 | |
| 3.5 | 147287 | |
| 5 | 128107 | |
| 3 | 70326 | 6.5% |
| 2.5 | 26565 | 2.5% |
| 2 | 13456 | 1.2% |
| 1 | 6381 | 0.6% |
| 1.5 | 3866 | 0.4% |
| (Missing) | 96636 | 8.9% |
| Value | Count | Frequency (%) |
| 1 | 6381 | 0.6% |
| 1.5 | 3866 | 0.4% |
| 2 | 13456 | 1.2% |
| 2.5 | 26565 | 2.5% |
| 3 | 70326 | 6.5% |
| 3.5 | 147287 | |
| 4 | 296979 | |
| 4.5 | 293794 | |
| 5 | 128107 |
| Value | Count | Frequency (%) |
| 5 | 128107 | |
| 4.5 | 293794 | |
| 4 | 296979 | |
| 3.5 | 147287 | |
| 3 | 70326 | 6.5% |
| 2.5 | 26565 | 2.5% |
| 2 | 13456 | 1.2% |
| 1.5 | 3866 | 0.4% |
| 1 | 6381 | 0.6% |
total_reviews_count
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWEDZEROS| Distinct | 3363 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 52235 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.8889893 |
| Minimum | 0 |
|---|---|
| Maximum | 52404 |
| Zeros | 44149 |
| Zeros (%) | 4.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 24 |
| Q3 | 93 |
| 95-th percentile | 459 |
| Maximum | 52404 |
| Range | 52404 |
| Interquartile range (IQR) | 87 |
Descriptive statistics
| Standard deviation | 267.2414795 |
|---|---|
| Coefficient of variation (CV) | 2.597376855 |
| Kurtosis | 2604.571173 |
| Mean | 102.8889893 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 25.28240244 |
| Sum | 106095216 |
| Variance | 71418.00837 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 61914 | 5.7% |
| 2 | 46078 | 4.3% |
| 0 | 44149 | 4.1% |
| 3 | 37374 | 3.4% |
| 4 | 32204 | 3.0% |
| 5 | 28324 | 2.6% |
| 6 | 25174 | 2.3% |
| 7 | 22880 | 2.1% |
| 8 | 20737 | 1.9% |
| 9 | 18983 | 1.8% |
| Other values (3353) | 693345 | |
| (Missing) | 52235 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 44149 | |
| 1 | 61914 | |
| 2 | 46078 | |
| 3 | 37374 | |
| 4 | 32204 | |
| 5 | 28324 | |
| 6 | 25174 | |
| 7 | 22880 | 2.1% |
| 8 | 20737 | 1.9% |
| 9 | 18983 | 1.8% |
| Value | Count | Frequency (%) |
| 52404 | 1 | |
| 33731 | 1 | |
| 31144 | 1 | |
| 30142 | 1 | |
| 29273 | 1 | |
| 24671 | 1 | |
| 22364 | 1 | |
| 19856 | 1 | |
| 19167 | 1 | |
| 18971 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Memory size | 64.9 MiB |
| English | |
|---|---|
| All languages |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 8.81207524 |
| Min length | 7 |
Characters and Unicode
| Total characters | 8708128 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | English |
|---|---|
| 2nd row | All languages |
| 3rd row | English |
| 4th row | English |
| 5th row | All languages |
Common Values
| Value | Count | Frequency (%) |
| English | 689754 | |
| All languages | 298450 | |
| (Missing) | 95193 | 8.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| english | 689754 | |
| all | 298450 | |
| languages | 298450 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1585104 | |
| g | 1286654 | |
| n | 988204 | |
| s | 988204 | |
| E | 689754 | |
| i | 689754 | |
| h | 689754 | |
| a | 596900 | 6.9% |
| A | 298450 | 3.4% |
| 298450 | 3.4% | |
| Other values (2) | 596900 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7421474 | |
| Uppercase Letter | 988204 | 11.3% |
| Space Separator | 298450 | 3.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1585104 | |
| g | 1286654 | |
| n | 988204 | |
| s | 988204 | |
| i | 689754 | |
| h | 689754 | |
| a | 596900 | 8.0% |
| u | 298450 | 4.0% |
| e | 298450 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 689754 | |
| A | 298450 |
Space Separator
| Value | Count | Frequency (%) |
| 298450 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8409678 | |
| Common | 298450 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1585104 | |
| g | 1286654 | |
| n | 988204 | |
| s | 988204 | |
| E | 689754 | |
| i | 689754 | |
| h | 689754 | |
| a | 596900 | 7.1% |
| A | 298450 | 3.5% |
| u | 298450 | 3.5% |
Common
| Value | Count | Frequency (%) |
| 298450 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8708128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1585104 | |
| g | 1286654 | |
| n | 988204 | |
| s | 988204 | |
| E | 689754 | |
| i | 689754 | |
| h | 689754 | |
| a | 596900 | 6.9% |
| A | 298450 | 3.4% |
| 298450 | 3.4% | |
| Other values (2) | 596900 | 6.9% |
reviews_count_in_default_language
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 2415 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.56341504 |
| Minimum | 1 |
|---|---|
| Maximum | 15229 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 26 |
| 95-th percentile | 205 |
| Maximum | 15229 |
| Range | 15228 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 148.7281789 |
|---|---|
| Coefficient of variation (CV) | 3.337450209 |
| Kurtosis | 586.0725255 |
| Mean | 44.56341504 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 14.3963658 |
| Sum | 44037745 |
| Variance | 22120.07119 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 173914 | |
| 2 | 101830 | 9.4% |
| 3 | 70931 | 6.5% |
| 4 | 54144 | 5.0% |
| 5 | 43513 | 4.0% |
| 6 | 35727 | 3.3% |
| 7 | 30477 | 2.8% |
| 8 | 26086 | 2.4% |
| 9 | 22586 | 2.1% |
| 10 | 20019 | 1.8% |
| Other values (2405) | 408977 | |
| (Missing) | 95193 | 8.8% |
| Value | Count | Frequency (%) |
| 1 | 173914 | |
| 2 | 101830 | |
| 3 | 70931 | |
| 4 | 54144 | 5.0% |
| 5 | 43513 | 4.0% |
| 6 | 35727 | 3.3% |
| 7 | 30477 | 2.8% |
| 8 | 26086 | 2.4% |
| 9 | 22586 | 2.1% |
| 10 | 20019 | 1.8% |
| Value | Count | Frequency (%) |
| 15229 | 1 | |
| 14717 | 1 | |
| 13716 | 1 | |
| 11997 | 1 | |
| 9239 | 1 | |
| 8845 | 1 | |
| 8576 | 1 | |
| 8458 | 1 | |
| 8337 | 1 | |
| 8224 | 1 |
excellent
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 1708 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.65343998 |
| Minimum | 0 |
|---|---|
| Maximum | 9383 |
| Zeros | 146592 |
| Zeros (%) | 13.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 13 |
| 95-th percentile | 113 |
| Maximum | 9383 |
| Range | 9383 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 89.85080105 |
|---|---|
| Coefficient of variation (CV) | 3.644554315 |
| Kurtosis | 541.9635977 |
| Mean | 24.65343998 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 14.78952065 |
| Sum | 24362628 |
| Variance | 8073.16645 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 186991 | |
| 0 | 146592 | |
| 2 | 102840 | 9.5% |
| 3 | 68452 | 6.3% |
| 4 | 50414 | 4.7% |
| 5 | 39200 | 3.6% |
| 6 | 31461 | 2.9% |
| 7 | 25848 | 2.4% |
| 8 | 21891 | 2.0% |
| 9 | 18898 | 1.7% |
| Other values (1698) | 295617 | |
| (Missing) | 95193 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 146592 | |
| 1 | 186991 | |
| 2 | 102840 | |
| 3 | 68452 | 6.3% |
| 4 | 50414 | 4.7% |
| 5 | 39200 | 3.6% |
| 6 | 31461 | 2.9% |
| 7 | 25848 | 2.4% |
| 8 | 21891 | 2.0% |
| 9 | 18898 | 1.7% |
| Value | Count | Frequency (%) |
| 9383 | 1 | |
| 7558 | 1 | |
| 7282 | 1 | |
| 6813 | 1 | |
| 5369 | 1 | |
| 5110 | 1 | |
| 4912 | 1 | |
| 4841 | 1 | |
| 4790 | 1 | |
| 4767 | 1 |
very_good
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 832 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.49051613 |
| Minimum | 0 |
|---|---|
| Maximum | 4091 |
| Zeros | 278879 |
| Zeros (%) | 25.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 6 |
| 95-th percentile | 48 |
| Maximum | 4091 |
| Range | 4091 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 35.51225601 |
|---|---|
| Coefficient of variation (CV) | 3.385177199 |
| Kurtosis | 734.0962925 |
| Mean | 10.49051613 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 15.87200946 |
| Sum | 10366770 |
| Variance | 1261.120327 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 278879 | |
| 1 | 199272 | |
| 2 | 100393 | 9.3% |
| 3 | 63244 | 5.8% |
| 4 | 44246 | 4.1% |
| 5 | 33394 | 3.1% |
| 6 | 26120 | 2.4% |
| 7 | 20995 | 1.9% |
| 8 | 17244 | 1.6% |
| 9 | 14702 | 1.4% |
| Other values (822) | 189715 | |
| (Missing) | 95193 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 278879 | |
| 1 | 199272 | |
| 2 | 100393 | 9.3% |
| 3 | 63244 | 5.8% |
| 4 | 44246 | 4.1% |
| 5 | 33394 | 3.1% |
| 6 | 26120 | 2.4% |
| 7 | 20995 | 1.9% |
| 8 | 17244 | 1.6% |
| 9 | 14702 | 1.4% |
| Value | Count | Frequency (%) |
| 4091 | 1 | |
| 3503 | 1 | |
| 3483 | 1 | |
| 2964 | 1 | |
| 2531 | 1 | |
| 2487 | 1 | |
| 2377 | 1 | |
| 2247 | 1 | |
| 2016 | 1 | |
| 1940 | 1 |
average
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWEDZEROS| Distinct | 458 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.10930233 |
| Minimum | 0 |
|---|---|
| Maximum | 2132 |
| Zeros | 493840 |
| Zeros (%) | 45.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 19 |
| Maximum | 2132 |
| Range | 2132 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 15.66963091 |
|---|---|
| Coefficient of variation (CV) | 3.813209555 |
| Kurtosis | 1413.191175 |
| Mean | 4.10930233 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 21.42675175 |
| Sum | 4060829 |
| Variance | 245.5373328 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 493840 | |
| 1 | 178020 | 16.4% |
| 2 | 78321 | 7.2% |
| 3 | 45446 | 4.2% |
| 4 | 30200 | 2.8% |
| 5 | 21577 | 2.0% |
| 6 | 16526 | 1.5% |
| 7 | 13032 | 1.2% |
| 8 | 10435 | 1.0% |
| 9 | 8992 | 0.8% |
| Other values (448) | 91815 | 8.5% |
| (Missing) | 95193 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 493840 | |
| 1 | 178020 | 16.4% |
| 2 | 78321 | 7.2% |
| 3 | 45446 | 4.2% |
| 4 | 30200 | 2.8% |
| 5 | 21577 | 2.0% |
| 6 | 16526 | 1.5% |
| 7 | 13032 | 1.2% |
| 8 | 10435 | 1.0% |
| 9 | 8992 | 0.8% |
| Value | Count | Frequency (%) |
| 2132 | 1 | |
| 2109 | 1 | |
| 1682 | 1 | |
| 1514 | 1 | |
| 1332 | 1 | |
| 1322 | 1 | |
| 1245 | 1 | |
| 1103 | 1 | |
| 1087 | 1 | |
| 1023 | 1 |
poor
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 305 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.355306192 |
| Minimum | 0 |
|---|---|
| Maximum | 1253 |
| Zeros | 614652 |
| Zeros (%) | 56.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 11 |
| Maximum | 1253 |
| Range | 1253 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 9.352756334 |
|---|---|
| Coefficient of variation (CV) | 3.970930135 |
| Kurtosis | 1078.095036 |
| Mean | 2.355306192 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.43930313 |
| Sum | 2327523 |
| Variance | 87.47405105 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 614652 | |
| 1 | 147712 | 13.6% |
| 2 | 61499 | 5.7% |
| 3 | 34639 | 3.2% |
| 4 | 22862 | 2.1% |
| 5 | 16294 | 1.5% |
| 6 | 11895 | 1.1% |
| 7 | 9247 | 0.9% |
| 8 | 7596 | 0.7% |
| 9 | 6274 | 0.6% |
| Other values (295) | 55534 | 5.1% |
| (Missing) | 95193 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 614652 | |
| 1 | 147712 | 13.6% |
| 2 | 61499 | 5.7% |
| 3 | 34639 | 3.2% |
| 4 | 22862 | 2.1% |
| 5 | 16294 | 1.5% |
| 6 | 11895 | 1.1% |
| 7 | 9247 | 0.9% |
| 8 | 7596 | 0.7% |
| 9 | 6274 | 0.6% |
| Value | Count | Frequency (%) |
| 1253 | 1 | |
| 1058 | 1 | |
| 991 | 1 | |
| 975 | 1 | |
| 856 | 1 | |
| 666 | 1 | |
| 594 | 1 | |
| 525 | 1 | |
| 516 | 1 | |
| 506 | 1 |
terrible
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 353 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 95193 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.954850416 |
| Minimum | 0 |
|---|---|
| Maximum | 1215 |
| Zeros | 573943 |
| Zeros (%) | 53.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 14 |
| Maximum | 1215 |
| Range | 1215 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 11.03006882 |
|---|---|
| Coefficient of variation (CV) | 3.732868764 |
| Kurtosis | 619.1381812 |
| Mean | 2.954850416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.37001589 |
| Sum | 2919995 |
| Variance | 121.6624181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 573943 | |
| 1 | 149446 | 13.8% |
| 2 | 65255 | 6.0% |
| 3 | 39030 | 3.6% |
| 4 | 26198 | 2.4% |
| 5 | 18811 | 1.7% |
| 6 | 14642 | 1.4% |
| 7 | 11719 | 1.1% |
| 8 | 9503 | 0.9% |
| 9 | 7903 | 0.7% |
| Other values (343) | 71754 | 6.6% |
| (Missing) | 95193 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 573943 | |
| 1 | 149446 | 13.8% |
| 2 | 65255 | 6.0% |
| 3 | 39030 | 3.6% |
| 4 | 26198 | 2.4% |
| 5 | 18811 | 1.7% |
| 6 | 14642 | 1.4% |
| 7 | 11719 | 1.1% |
| 8 | 9503 | 0.9% |
| 9 | 7903 | 0.7% |
| Value | Count | Frequency (%) |
| 1215 | 1 | |
| 1059 | 1 | |
| 948 | 1 | |
| 932 | 1 | |
| 725 | 1 | |
| 631 | 1 | |
| 611 | 1 | |
| 592 | 1 | |
| 590 | 1 | |
| 589 | 1 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 484072 |
| Missing (%) | 44.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.104178868 |
| Minimum | 1 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.5642075197 |
|---|---|
| Coefficient of variation (CV) | 0.1374714743 |
| Kurtosis | 1.385907272 |
| Mean | 4.104178868 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.9138337666 |
| Sum | 2459737 |
| Variance | 0.3183301253 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.5 | 221346 | |
| 4 | 197437 | |
| 3.5 | 87173 | 8.0% |
| 5 | 49108 | 4.5% |
| 3 | 30317 | 2.8% |
| 2.5 | 9843 | 0.9% |
| 2 | 3251 | 0.3% |
| 1.5 | 752 | 0.1% |
| 1 | 98 | < 0.1% |
| (Missing) | 484072 |
| Value | Count | Frequency (%) |
| 1 | 98 | < 0.1% |
| 1.5 | 752 | 0.1% |
| 2 | 3251 | 0.3% |
| 2.5 | 9843 | 0.9% |
| 3 | 30317 | 2.8% |
| 3.5 | 87173 | 8.0% |
| 4 | 197437 | |
| 4.5 | 221346 | |
| 5 | 49108 | 4.5% |
| Value | Count | Frequency (%) |
| 5 | 49108 | 4.5% |
| 4.5 | 221346 | |
| 4 | 197437 | |
| 3.5 | 87173 | 8.0% |
| 3 | 30317 | 2.8% |
| 2.5 | 9843 | 0.9% |
| 2 | 3251 | 0.3% |
| 1.5 | 752 | 0.1% |
| 1 | 98 | < 0.1% |
service
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 479110 |
| Missing (%) | 44.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.067245365 |
| Minimum | 1 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.5812667001 |
|---|---|
| Coefficient of variation (CV) | 0.1429140973 |
| Kurtosis | 1.198465749 |
| Mean | 4.067245365 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.8357308358 |
| Sum | 2457783.5 |
| Variance | 0.3378709766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 202956 | |
| 4.5 | 202637 | |
| 3.5 | 98240 | 9.1% |
| 5 | 49937 | 4.6% |
| 3 | 34711 | 3.2% |
| 2.5 | 10904 | 1.0% |
| 2 | 3771 | 0.3% |
| 1.5 | 1004 | 0.1% |
| 1 | 127 | < 0.1% |
| (Missing) | 479110 |
| Value | Count | Frequency (%) |
| 1 | 127 | < 0.1% |
| 1.5 | 1004 | 0.1% |
| 2 | 3771 | 0.3% |
| 2.5 | 10904 | 1.0% |
| 3 | 34711 | 3.2% |
| 3.5 | 98240 | |
| 4 | 202956 | |
| 4.5 | 202637 | |
| 5 | 49937 | 4.6% |
| Value | Count | Frequency (%) |
| 5 | 49937 | 4.6% |
| 4.5 | 202637 | |
| 4 | 202956 | |
| 3.5 | 98240 | |
| 3 | 34711 | 3.2% |
| 2.5 | 10904 | 1.0% |
| 2 | 3771 | 0.3% |
| 1.5 | 1004 | 0.1% |
| 1 | 127 | < 0.1% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 480705 |
| Missing (%) | 44.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.982896737 |
| Minimum | 1 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5776048366 |
|---|---|
| Coefficient of variation (CV) | 0.1450212935 |
| Kurtosis | 1.050370969 |
| Mean | 3.982896737 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.7419397672 |
| Sum | 2400460 |
| Variance | 0.3336273473 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 217858 | |
| 4.5 | 172698 | 15.9% |
| 3.5 | 118325 | 10.9% |
| 3 | 41754 | 3.9% |
| 5 | 34268 | 3.2% |
| 2.5 | 12564 | 1.2% |
| 2 | 3995 | 0.4% |
| 1.5 | 1035 | 0.1% |
| 1 | 195 | < 0.1% |
| (Missing) | 480705 |
| Value | Count | Frequency (%) |
| 1 | 195 | < 0.1% |
| 1.5 | 1035 | 0.1% |
| 2 | 3995 | 0.4% |
| 2.5 | 12564 | 1.2% |
| 3 | 41754 | 3.9% |
| 3.5 | 118325 | |
| 4 | 217858 | |
| 4.5 | 172698 | |
| 5 | 34268 | 3.2% |
| Value | Count | Frequency (%) |
| 5 | 34268 | 3.2% |
| 4.5 | 172698 | |
| 4 | 217858 | |
| 3.5 | 118325 | |
| 3 | 41754 | 3.9% |
| 2.5 | 12564 | 1.2% |
| 2 | 3995 | 0.4% |
| 1.5 | 1035 | 0.1% |
| 1 | 195 | < 0.1% |
atmosphere
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 821612 |
| Missing (%) | 75.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.93368222 |
| Minimum | 1 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3.5 |
| median | 4 |
| Q3 | 4.5 |
| 95-th percentile | 4.5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5552619671 |
|---|---|
| Coefficient of variation (CV) | 0.1411557762 |
| Kurtosis | 0.9718204601 |
| Mean | 3.93368222 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.7228480256 |
| Sum | 1029779 |
| Variance | 0.3083158521 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 99726 | 9.2% |
| 4.5 | 69508 | 6.4% |
| 3.5 | 56092 | 5.2% |
| 3 | 20329 | 1.9% |
| 5 | 8686 | 0.8% |
| 2.5 | 5439 | 0.5% |
| 2 | 1554 | 0.1% |
| 1.5 | 387 | < 0.1% |
| 1 | 64 | < 0.1% |
| (Missing) | 821612 |
| Value | Count | Frequency (%) |
| 1 | 64 | < 0.1% |
| 1.5 | 387 | < 0.1% |
| 2 | 1554 | 0.1% |
| 2.5 | 5439 | 0.5% |
| 3 | 20329 | 1.9% |
| 3.5 | 56092 | |
| 4 | 99726 | |
| 4.5 | 69508 | |
| 5 | 8686 | 0.8% |
| Value | Count | Frequency (%) |
| 5 | 8686 | 0.8% |
| 4.5 | 69508 | |
| 4 | 99726 | |
| 3.5 | 56092 | |
| 3 | 20329 | 1.9% |
| 2.5 | 5439 | 0.5% |
| 2 | 1554 | 0.1% |
| 1.5 | 387 | < 0.1% |
| 1 | 64 | < 0.1% |
| Distinct | 99001 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 984199 |
| Missing (%) | 90.8% |
| Memory size | 40.7 MiB |
| steak, onion loaf, lettuce wedge, chateaubriand, t bone | 7 |
|---|---|
| curry, poppadoms, rice, lamb, best indian | 6 |
| curry, poppadoms, rice, lamb, prawns | 6 |
| curry, rice, naan, lamb, prawns | 5 |
| curry, poppadoms, chicken, indian food, best indian | 5 |
| Other values (98996) |
Length
| Max length | 125 |
|---|---|
| Median length | 54 |
| Mean length | 55.35753745 |
| Min length | 25 |
Characters and Unicode
| Total characters | 5491357 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 98859 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | pizza, tartiflette, fondue, service was excellent, chef |
|---|---|
| 2nd row | tuna, fish, tapas, vineyard, region |
| 3rd row | visited cafe, food was fantastic, both occasions, mascarpone, courses |
| 4th row | pates, apple tart, main course, wine list, set menu |
| 5th row | lunch, large trees, fantastic value, courses, euros |
Common Values
| Value | Count | Frequency (%) |
| steak, onion loaf, lettuce wedge, chateaubriand, t bone | 7 | < 0.1% |
| curry, poppadoms, rice, lamb, best indian | 6 | < 0.1% |
| curry, poppadoms, rice, lamb, prawns | 6 | < 0.1% |
| curry, rice, naan, lamb, prawns | 5 | < 0.1% |
| curry, poppadoms, chicken, indian food, best indian | 5 | < 0.1% |
| curry, poppadoms, rice, lamb, bread | 5 | < 0.1% |
| curry, poppadoms, rice, onion bhaji, lamb | 5 | < 0.1% |
| curry, poppadoms, chicken, onion bhaji, best indian | 5 | < 0.1% |
| curry, poppadoms, onion bhaji, rice, lamb | 4 | < 0.1% |
| steak, onion loaf, lettuce wedge, chateaubriand, calamari | 4 | < 0.1% |
| Other values (98991) | 99146 | 9.2% |
| (Missing) | 984199 |
Length
| Value | Count | Frequency (%) |
| food | 18167 | 2.3% |
| steak | 13671 | 1.7% |
| salad | 13237 | 1.7% |
| and | 11514 | 1.4% |
| fish | 11263 | 1.4% |
| chicken | 10558 | 1.3% |
| great | 10258 | 1.3% |
| bread | 9867 | 1.2% |
| burger | 9278 | 1.2% |
| pizza | 9135 | 1.1% |
| Other values (14991) | 679867 |
Most occurring characters
| Value | Count | Frequency (%) |
| 697617 | ||
| e | 488399 | 8.9% |
| a | 469944 | 8.6% |
| , | 396792 | 7.2% |
| s | 350816 | 6.4% |
| i | 294897 | 5.4% |
| r | 293001 | 5.3% |
| t | 270768 | 4.9% |
| o | 266317 | 4.8% |
| n | 239836 | 4.4% |
| Other values (32) | 1722970 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4396492 | |
| Space Separator | 697617 | 12.7% |
| Other Punctuation | 397202 | 7.2% |
| Decimal Number | 46 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 488399 | 11.1% |
| a | 469944 | 10.7% |
| s | 350816 | 8.0% |
| i | 294897 | 6.7% |
| r | 293001 | 6.7% |
| t | 270768 | 6.2% |
| o | 266317 | 6.1% |
| n | 239836 | 5.5% |
| c | 217262 | 4.9% |
| l | 198060 | 4.5% |
| Other values (19) | 1307192 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 8 | |
| 5 | 6 | |
| 1 | 6 | |
| 9 | 5 | |
| 3 | 5 | |
| 8 | 4 | |
| 0 | 4 | |
| 6 | 3 | 6.5% |
| 7 | 3 | 6.5% |
| 4 | 2 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 396792 | |
| ' | 410 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 697617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4396492 | |
| Common | 1094865 | 19.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 488399 | 11.1% |
| a | 469944 | 10.7% |
| s | 350816 | 8.0% |
| i | 294897 | 6.7% |
| r | 293001 | 6.7% |
| t | 270768 | 6.2% |
| o | 266317 | 6.1% |
| n | 239836 | 5.5% |
| c | 217262 | 4.9% |
| l | 198060 | 4.5% |
| Other values (19) | 1307192 |
Common
| Value | Count | Frequency (%) |
| 697617 | ||
| , | 396792 | |
| ' | 410 | < 0.1% |
| 2 | 8 | < 0.1% |
| 5 | 6 | < 0.1% |
| 1 | 6 | < 0.1% |
| 9 | 5 | < 0.1% |
| 3 | 5 | < 0.1% |
| 8 | 4 | < 0.1% |
| 0 | 4 | < 0.1% |
| Other values (3) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5490861 | |
| None | 496 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 697617 | ||
| e | 488399 | 8.9% |
| a | 469944 | 8.6% |
| , | 396792 | 7.2% |
| s | 350816 | 6.4% |
| i | 294897 | 5.4% |
| r | 293001 | 5.3% |
| t | 270768 | 4.9% |
| o | 266317 | 4.9% |
| n | 239836 | 4.4% |
| Other values (29) | 1722474 |
None
| Value | Count | Frequency (%) |
| é | 271 | |
| û | 215 | |
| â | 10 | 2.0% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| restaurant_link | restaurant_name | original_location | country | region | province | city | address | latitude | longitude | claimed | awards | popularity_detailed | popularity_generic | top_tags | price_level | price_range | meals | cuisines | special_diets | features | vegetarian_friendly | vegan_options | gluten_free | original_open_hours | open_days_per_week | open_hours_per_week | working_shifts_per_week | avg_rating | total_reviews_count | default_language | reviews_count_in_default_language | excellent | very_good | average | poor | terrible | food | service | value | atmosphere | keywords | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | g10001637-d10002227 | Le 147 | ["Europe", "France", "Nouvelle-Aquitaine", "Haute-Vienne", "Saint-Jouvent"] | France | Nouvelle-Aquitaine | Haute-Vienne | Saint-Jouvent | 10 Maison Neuve, 87510 Saint-Jouvent France | 45.961674 | 1.169131 | Claimed | NaN | #1 of 2 Restaurants in Saint-Jouvent | #1 of 2 places to eat in Saint-Jouvent | Cheap Eats, French | € | NaN | Lunch, Dinner | French | NaN | Reservations, Seating, Wheelchair Accessible, Serves Alcohol, Accepts Credit Cards, Table Service | N | N | N | NaN | NaN | NaN | NaN | 4.0 | 36.0 | English | 2.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.0 | 4.5 | 4.0 | NaN | NaN |
| 1 | g10001637-d14975787 | Le Saint Jouvent | ["Europe", "France", "Nouvelle-Aquitaine", "Haute-Vienne", "Saint-Jouvent"] | France | Nouvelle-Aquitaine | Haute-Vienne | Saint-Jouvent | 16 Place de l Eglise, 87510 Saint-Jouvent France | 45.957040 | 1.205480 | Unclaimed | NaN | #2 of 2 Restaurants in Saint-Jouvent | #2 of 2 places to eat in Saint-Jouvent | Cheap Eats | € | NaN | NaN | NaN | NaN | NaN | N | N | N | NaN | NaN | NaN | NaN | 4.0 | 5.0 | All languages | 5.0 | 2.0 | 2.0 | 1.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN |
| 2 | g10002858-d4586832 | Au Bout du Pont | ["Europe", "France", "Centre-Val de Loire", "Berry", "Indre", "Rivarennes"] | France | Centre-Val de Loire | Berry | Rivarennes | 2 rue des Dames, 36800 Rivarennes France | 46.635895 | 1.386133 | Claimed | NaN | #1 of 1 Restaurant in Rivarennes | #1 of 1 places to eat in Rivarennes | Cheap Eats, French, European | € | NaN | Dinner, Lunch, Drinks | French, European | NaN | Reservations, Seating, Table Service, Wheelchair Accessible | N | N | N | NaN | NaN | NaN | NaN | 5.0 | 13.0 | English | 4.0 | 3.0 | 1.0 | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN |
| 3 | g10002986-d3510044 | Le Relais de Naiade | ["Europe", "France", "Nouvelle-Aquitaine", "Correze", "Lacelle"] | France | Nouvelle-Aquitaine | Correze | Lacelle | 9 avenue Porte de la Correze 19170, 19170 Lacelle France | 45.642610 | 1.824460 | Claimed | NaN | #1 of 1 Restaurant in Lacelle | #1 of 1 places to eat in Lacelle | Cheap Eats, French | € | NaN | Lunch, Dinner | French | NaN | Reservations, Seating, Serves Alcohol, Table Service, Wheelchair Accessible | N | N | N | NaN | NaN | NaN | NaN | 4.0 | 34.0 | English | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.5 | 4.5 | 4.5 | NaN | NaN |
| 4 | g10022428-d9767191 | Relais Du MontSeigne | ["Europe", "France", "Occitanie", "Aveyron", "Saint-Laurent-de-Levezou"] | France | Occitanie | Aveyron | Saint-Laurent-de-Levezou | route du Montseigne, 12620 Saint-Laurent-de-Levezou France | 44.208860 | 2.960470 | Unclaimed | NaN | #1 of 1 Restaurant in Saint-Laurent-de-Levezou | #1 of 1 places to eat in Saint-Laurent-de-Levezou | Mid-range, French | €€-€€€ | NaN | Lunch, Dinner | French | NaN | Reservations, Seating, Wheelchair Accessible, Table Service | N | N | N | NaN | NaN | NaN | NaN | 4.5 | 11.0 | All languages | 11.0 | 4.0 | 7.0 | 0.0 | 0.0 | 0.0 | 4.5 | 4.5 | 4.5 | NaN | NaN |
| 5 | g10029260-d6605477 | L'Auberge Du Vieux Crozet | ["Europe", "France", "Auvergne-Rhone-Alpes", "Loire", "Roanne", "Le Crozet"] | France | Auvergne-Rhone-Alpes | Loire | Le Crozet | 59 place du Puits ancienne adresse le Bourg renommée 59 place du Puits, 42310 Le Crozet, Roanne France | 46.169823 | 3.855819 | Claimed | Travellers' Choice, Certificate of Excellence 2020 | #1 of 1 Restaurant in Le Crozet | #1 of 1 places to eat in Le Crozet | Mid-range, French | €€-€€€ | €14-€29 | Lunch, Dinner, Drinks | French | NaN | NaN | N | N | N | {"Mon": ["09:00-14:30"], "Tue": ["09:00-14:30", "19:00-21:30"], "Wed": ["09:00-14:30", "19:00-21:30"], "Thu": ["09:00-14:30", "19:00-21:30"], "Fri": ["09:00-14:30", "19:00-22:00"], "Sat": ["09:00-14:30", "19:00-22:00"], "Sun": ["09:00-16:00"]} | 7.0 | 53.5 | 12.0 | 4.5 | 64.0 | All languages | 64.0 | 44.0 | 15.0 | 2.0 | 2.0 | 1.0 | 4.5 | 4.5 | 4.5 | NaN | NaN |
| 6 | g10029907-d17781655 | Cafe Restaurant NouLou | ["Europe", "France", "Occitanie", "Aude", "Saint-Denis"] | France | Occitanie | Aude | Saint-Denis | Place de l'Église, 30500 Saint-Denis France | 44.233078 | 4.251449 | Claimed | NaN | #2 of 2 Restaurants in Saint-Denis | #2 of 2 places to eat in Saint-Denis | Mid-range, French, European | €€-€€€ | €8-€17 | Lunch, Dinner | French, European | NaN | NaN | N | N | N | {"Mon": [], "Tue": [], "Wed": ["12:00-14:30", "18:30-22:00"], "Thu": ["12:00-14:30", "18:30-22:00"], "Fri": ["12:00-14:30", "18:30-22:00"], "Sat": ["12:00-14:30", "18:30-22:00"], "Sun": ["12:00-14:30", "18:30-22:00"]} | 5.0 | 30.0 | 10.0 | 4.5 | 24.0 | English | 4.0 | 4.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.5 | 4.5 | 4.5 | NaN | NaN |
| 7 | g10029907-d8079764 | L'entre 2 | ["Europe", "France", "Occitanie", "Aude", "Saint-Denis"] | France | Occitanie | Aude | Saint-Denis | 4 route de Saissac, 11310 Saint-Denis France | 43.360023 | 2.219851 | Claimed | Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017 | #1 of 2 Restaurants in Saint-Denis | #1 of 2 places to eat in Saint-Denis | Mid-range, French, European, Vegetarian Friendly | €€-€€€ | €10-€35 | NaN | French, European | Vegetarian Friendly | NaN | Y | N | N | {"Mon": [], "Tue": ["10:00-14:00"], "Wed": ["10:00-14:00"], "Thu": ["10:00-14:00"], "Fri": ["10:00-14:00"], "Sat": ["10:00-14:00"], "Sun": ["10:00-14:00"]} | 6.0 | 24.0 | 6.0 | 4.5 | 133.0 | English | 13.0 | 9.0 | 3.0 | 1.0 | 0.0 | 0.0 | 4.5 | 4.5 | 4.5 | NaN | NaN |
| 8 | g10036850-d8414223 | Noste Courtiu | ["Europe", "France", "Occitanie", "Ariege", "Orgibet"] | France | Occitanie | Ariege | Orgibet | route des Pyrenees, 09800 Orgibet France | 42.934000 | 0.936559 | Claimed | NaN | #1 of 1 Restaurant in Orgibet | #1 of 1 places to eat in Orgibet | Mid-range, French, Cafe, Deli | €€-€€€ | €12-€26 | Lunch, Dinner, Drinks | French, Cafe, Deli, Contemporary, Gastropub | NaN | NaN | N | N | N | {"Mon": [], "Tue": [], "Wed": ["12:00-14:00"], "Thu": ["12:00-14:00"], "Fri": ["12:00-14:00", "19:00-21:00"], "Sat": ["19:00-21:00"], "Sun": ["12:00-14:00", "19:00-21:00"]} | 5.0 | 14.0 | 7.0 | 5.0 | 39.0 | English | 2.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.5 | 4.5 | 4.5 | NaN | NaN |
| 9 | g10054961-d3387712 | Chez Claudine | ["Europe", "France", "Grand Est", "Vosges", "They-sous-Montfort"] | France | Grand Est | Vosges | They-sous-Montfort | 136 rue de la Petite They, 88800 They-sous-Montfort France | 48.231495 | 5.973734 | Claimed | Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016, Certificate of Excellence 2015 | #1 of 1 Restaurant in They-sous-Montfort | #1 of 1 places to eat in They-sous-Montfort | Mid-range, French, European, Wine Bar | €€-€€€ | €12-€30 | After-hours, Drinks, Lunch, Dinner, Brunch | French, European, Wine Bar | NaN | NaN | N | N | N | {"Mon": [], "Tue": ["09:00-16:00"], "Wed": ["09:00-16:00"], "Thu": ["09:00-21:00"], "Fri": ["09:00-21:00"], "Sat": ["16:45-23:45"], "Sun": ["09:00-17:00"]} | 6.0 | 53.0 | 6.0 | 4.5 | 244.0 | English | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.5 | 4.5 | 4.5 | 4.5 | NaN |
Last rows
| restaurant_link | restaurant_name | original_location | country | region | province | city | address | latitude | longitude | claimed | awards | popularity_detailed | popularity_generic | top_tags | price_level | price_range | meals | cuisines | special_diets | features | vegetarian_friendly | vegan_options | gluten_free | original_open_hours | open_days_per_week | open_hours_per_week | working_shifts_per_week | avg_rating | total_reviews_count | default_language | reviews_count_in_default_language | excellent | very_good | average | poor | terrible | food | service | value | atmosphere | keywords | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1083387 | g946544-d4753627 | Pensiunea Restaurant Rosenau | ["Europe", "Romania", "Transylvania", "Central Romania", "Brasov County", "Rasnov"] | Romania | Transylvania | Brasov County | Rasnov | Str. Florilor 27, Rasnov 505400 Romania | 45.592760 | 25.462336 | Unclaimed | NaN | #13 of 14 Restaurants in Rasnov | #13 of 15 places to eat in Rasnov | Mid-range, European, Eastern European, Romanian | €€-€€€ | NaN | NaN | European, Romanian, Eastern European | NaN | Seating, Serves Alcohol, Reservations, Table Service | N | N | N | {"Mon": ["10:00-00:00"], "Tue": ["10:00-00:00"], "Wed": ["10:00-00:00"], "Thu": ["10:00-00:00"], "Fri": ["10:00-00:00"], "Sat": ["10:00-00:00"], "Sun": ["10:00-00:00"]} | 7.0 | 98.0 | 7.0 | 3.0 | 61.0 | English | 43.0 | 7.0 | 13.0 | 8.0 | 6.0 | 9.0 | 3.5 | 3.0 | 3.5 | 4.0 | NaN |
| 1083388 | g946544-d7128597 | La Promenada | ["Europe", "Romania", "Transylvania", "Central Romania", "Brasov County", "Rasnov"] | Romania | Transylvania | Brasov County | Rasnov | Str. Teilor nr. 88, Rasnov 505400 Romania | 45.597910 | 25.472660 | Claimed | Travellers' Choice, Certificate of Excellence 2020, Certificate of Excellence 2019, Certificate of Excellence 2018, Certificate of Excellence 2017, Certificate of Excellence 2016 | #1 of 14 Restaurants in Rasnov | #1 of 15 places to eat in Rasnov | Mid-range, Romanian, Vegetarian Friendly, Vegan Options | €€-€€€ | NaN | Breakfast, Lunch, Dinner, Brunch | Romanian | Vegetarian Friendly, Vegan Options, Gluten Free Options | NaN | Y | Y | Y | {"Mon": ["13:00-23:00"], "Tue": ["11:00-23:00"], "Wed": ["11:00-23:00"], "Thu": ["11:00-23:00"], "Fri": ["10:00-23:45"], "Sat": ["10:00-23:45"], "Sun": ["10:00-23:00"]} | 7.0 | 86.5 | 7.0 | 4.0 | 355.0 | English | 266.0 | 159.0 | 48.0 | 25.0 | 11.0 | 23.0 | 4.5 | 4.0 | 4.5 | NaN | goulash, soup, pork, outdoor playground, nice meal |
| 1083389 | g946544-d8490226 | Papazaur | ["Europe", "Romania", "Transylvania", "Central Romania", "Brasov County", "Rasnov"] | Romania | Transylvania | Brasov County | Rasnov | Strada Cet ii, Rasnov 505400 Romania | 45.591618 | 25.474144 | Unclaimed | NaN | #12 of 14 Restaurants in Rasnov | #12 of 15 places to eat in Rasnov | Mid-range, Fast food, European | €€-€€€ | NaN | Lunch, Brunch | Fast food, European | NaN | Seating | N | N | N | NaN | NaN | NaN | NaN | 3.5 | 21.0 | English | 19.0 | 3.0 | 7.0 | 4.0 | 3.0 | 2.0 | 4.0 | 4.5 | 4.0 | NaN | NaN |
| 1083390 | g946544-d8749966 | Intim | ["Europe", "Romania", "Transylvania", "Central Romania", "Brasov County", "Rasnov"] | Romania | Transylvania | Brasov County | Rasnov | Strada Ion Creanga 1, Rasnov 505400 Romania | 45.590786 | 25.463972 | Unclaimed | NaN | #8 of 14 Restaurants in Rasnov | #8 of 15 places to eat in Rasnov | Mid-range, Eastern European, Romanian | €€-€€€ | NaN | Lunch, Dinner, Brunch | Eastern European, Romanian | NaN | Reservations, Wheelchair Accessible, Outdoor Seating, Seating, Table Service | N | N | N | NaN | NaN | NaN | NaN | 4.0 | 22.0 | English | 16.0 | 5.0 | 6.0 | 4.0 | 0.0 | 1.0 | NaN | NaN | NaN | NaN | NaN |
| 1083391 | g9610184-d19807817 | Casa Amicii | ["Europe", "Romania", "Transylvania", "Western Romania", "Hunedoara County", "Uricani"] | Romania | Transylvania | Hunedoara County | Uricani | Aleea Teilor 34, Uricani 336100 Romania | 45.333020 | 23.124910 | Unclaimed | NaN | #1 of 1 Restaurant in Uricani | #1 of 1 places to eat in Uricani | European, Romanian | NaN | NaN | NaN | European, Romanian | NaN | NaN | N | N | N | NaN | NaN | NaN | NaN | 5.0 | 1.0 | All languages | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN |
| 1083392 | g9710275-d10770782 | Complex Popas Pacurari | ["Europe", "Romania", "Northeast Romania", "Iasi County", "Valea Lupului"] | Romania | Northeast Romania | Iasi County | NaN | Soseaua Pacurari, Valea Lupului 707410 Romania | 47.172950 | 27.519110 | Unclaimed | NaN | #1 of 1 Restaurant in Valea Lupului | #1 of 1 places to eat in Valea Lupului | NaN | NaN | NaN | Lunch, Dinner | NaN | NaN | NaN | N | N | N | {"Mon": ["10:00-22:00"], "Tue": ["10:00-22:00"], "Wed": ["10:00-22:00"], "Thu": ["10:00-22:00"], "Fri": ["10:00-22:00"], "Sat": ["10:00-22:00"], "Sun": ["10:00-22:00"]} | 7.0 | 84.0 | 7.0 | 2.5 | 2.0 | English | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | NaN | NaN | NaN | NaN | NaN |
| 1083393 | g9716321-d15026574 | Casa Pastravarului DORIPESCO | ["Europe", "Romania", "Transylvania", "Central Romania", "Brasov County", "Apata"] | Romania | Transylvania | Brasov County | Apata | DN 13 Judetul Kilometrul 33 Maierus, Apata 507005 Romania | 45.904423 | 25.470509 | Claimed | NaN | #1 of 1 Restaurant in Apata | #1 of 1 places to eat in Apata | Mid-range, Eastern European | €€-€€€ | NaN | Breakfast, Lunch, Dinner, Brunch, Drinks | Eastern European | NaN | NaN | N | N | N | {"Mon": ["08:00-22:00"], "Tue": ["08:00-22:00"], "Wed": ["08:00-22:00"], "Thu": ["08:00-22:00"], "Fri": ["08:00-22:00"], "Sat": ["08:00-22:00"], "Sun": ["08:00-22:00"]} | 7.0 | 98.0 | 7.0 | 2.0 | 6.0 | English | 5.0 | 0.0 | 1.0 | 1.0 | 1.0 | 2.0 | NaN | NaN | NaN | NaN | NaN |
| 1083394 | g9722813-d15891057 | Hanul Tentea | ["Europe", "Romania", "Transylvania", "Northwest Romania", "Maramures County", "Sacel"] | Romania | Transylvania | Maramures County | Sacel | DN17C, Sacel Romania | 47.631920 | 24.450910 | Unclaimed | NaN | #1 of 1 Restaurant in Sacel | #1 of 1 places to eat in Sacel | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | N | N | NaN | NaN | NaN | NaN | 3.0 | 2.0 | English | 2.0 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | NaN | NaN | NaN | NaN | NaN |
| 1083395 | g9726871-d21391722 | Casa Paduraru | ["Europe", "Romania", "Southern Romania", "Arges County", "Maracineni"] | Romania | Southern Romania | Arges County | NaN | Sat. Argeselu Numarul 432, Maracineni 117450 Romania | 44.918950 | 24.867634 | Claimed | NaN | NaN | NaN | Cheap Eats, French, American, Bar | € | €2-€8 | Breakfast, Lunch, Dinner, Brunch, Drinks | French, American, Bar, International, European, Pub, Romanian | NaN | NaN | N | N | N | {"Mon": ["10:00-21:00"], "Tue": ["10:00-21:00"], "Wed": ["10:00-21:00"], "Thu": ["10:00-21:00"], "Fri": ["10:00-21:00"], "Sat": [], "Sun": []} | 5.0 | 55.0 | 5.0 | NaN | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1083396 | g9867250-d14979687 | Pastravaria Alina Sarbi | ["Europe", "Romania", "Transylvania", "Northwest Romania", "Maramures County", "Budesti"] | Romania | Transylvania | Maramures County | Budesti | Str. Principala Nr 166A, Budesti 437071 Romania | 47.752220 | 23.938343 | Unclaimed | NaN | #1 of 1 Restaurant in Budesti | #1 of 1 places to eat in Budesti | Diner | NaN | NaN | NaN | Diner | NaN | NaN | N | N | N | NaN | NaN | NaN | NaN | 1.5 | 3.0 | English | 2.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | NaN | NaN | NaN | NaN | NaN |